[bionet.molbio.genbank.updates] C. tentans ORF's

GenBank-Updates@genbank.bio.net (05/25/91)

LOCUS       CHIORFHB     6997 bp ds-DNA             INV       25-MAY-1991
DEFINITION  C. tentans ORF's (A-E) for hemoglobin.
ACCESSION   X56272
KEYWORDS    hemoglobin.
SOURCE      Chironomus tentans DNA.
  ORGANISM  Chironomus tentans
            Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta;
            Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea;
            Chironomidae.
REFERENCE   1  (bases 1 to 6997)
  AUTHORS   Schmidt,E.R.
  JOURNAL   Unpublished (1990)
  STANDARD  full automatic
REFERENCE   2  (bases 1 to 6997)
  AUTHORS   Rozynek,P., Broeker,M., Hankeln,T. and Schmidt,E.R.
  TITLE     The primary structure of several hemoglobin genes from the genome
            of Chironomus tentans
  JOURNAL   Unpublished (1991)
  STANDARD  full automatic
COMMENT     *source: library=genomic ten; clone=ten Hb5; **map: chromosome=III;
            map position=A6;
            
            From EMBL 26   entry CTORFHB;  dated 28-JAN-1991.
FEATURES             Location/Qualifiers
     prim_transcript complement(<1063..>1431)
                     /note="possible pseudogene"
                     /gene="ORF B"
     mRNA            complement(<1969..>2451)
                     /function="hemoglobin production"
                     /gene="ORF C"
     prim_transcript complement(<1969..>2451)
                     /function="hemoglobin production"
                     /gene="ORF C"
     mRNA            complement(<4000..>4485)
                     /function="hemoglobin production"
                     /gene="ORF D"
     prim_transcript complement(<4000..>4485)
                     /function="hemoglobin production"
                     /gene="ORF D"
     mRNA            complement(<5154..>5639)
                     /function="hemoglobin production"
                     /gene="ORF E"
     prim_transcript complement(<5154..>5639)
                     /function="hemoglobin production"
                     /gene="ORF E"
     mRNA            <52..>543
                     /function="hemoglobin production"
                     /gene="ORF A"
     prim_transcript <52..>543
                     /function="hemoglobin production"
                     /gene="ORF A"
     CDS             complement(<1063..>1431)
                     /pseudo
                     /gene="ORF B"
                     /codon_start=1431
     sig_peptide     52..99
                     /function="hemoglobin production"
     CDS             52..543
                     /function="hemoglobin production"
                     /gene="ORF A"
                     /codon_start=52
     mat_peptide     100..540
                     /function="hemoglobin production"
                     /gene="ORF A"
     polyA_signal    591..596
     TATA_signal     complement(1502..1507)
     CDS             complement(1969..2451)
                     /function="hemoglobin production"
                     /gene="ORF C"
                     /codon_start=2451
     mat_peptide     complement(1972..2406)
                     /function="hemoglobin production"
                     /gene="ORF C"
     polyA_signal    complement(1974..1979)
     sig_peptide     complement(2407..2451)
                     /function="hemoglobin production"
     TATA_signal     complement(2524..2529)
     polyA_signal    complement(3934..3939)
     CDS             complement(4000..4485)
                     /function="hemoglobin production"
                     /gene="ORF D"
                     /codon_start=4485
     mat_peptide     complement(4003..4437)
                     /function="hemoglobin production"
                     /gene="ORF D"
     sig_peptide     complement(4438..4485)
                     /function="hemoglobin production"
     TATA_signal     complement(4555..4560)
     polyA_signal    complement(5050..5055)
     CDS             complement(5154..5639)
                     /function="hemoglobin production"
                     /gene="ORF E"
                     /codon_start=5639
     mat_peptide     complement(5157..5591)
                     /function="hemoglobin production"
                     /gene="ORF E"
     sig_peptide     complement(5592..5639)
                     /function="hemoglobin production"
     TATA_signal     complement(5709..5714)
BASE COUNT     2290 a   1160 c   1309 g   2238 t
ORIGIN
        1 gatcagctaa gttttcaatt gaatttgaat ctataaacta atccaggcaa aatgaaattc
       61 tttgttgttc ttgctctctg cattgccgca gcctcagctg ctgtcgtacc attgtctgct
      121 gaccaagctt cacttgtcaa gtcatcatgg aaccaagtca aacacaacga agttgatatt
      181 cttgctgcca tcttcgccgc caacccagac attcaagccc gtttctcaca attcgctgga
      241 aaggatgttg ctggattgaa agacacagct gccttcgcca cacatgccgg aagaatcgtc
      301 ggtttcttct cagaaatcat tggacttact ggaaacgcag ctaatgcccc agccctccaa
      361 acactcgtcg gacaactcgc tgccagccac aaggcccgtg gaatcccaac tgctcaattc
      421 ggtgaattca gaacctcact cgtcgcatac ctccaagcta atgtctcatg gggtgacaat
      481 gttgctgctg cctggaacca agctcttgac aacttattct tcgttcttac ctcaaactac
      541 taaacagatc aataatcttt ttcaatagtg tacagctttg aatgcaatag aataaatttt
      601 cttgattaat ttaattcaaa catttgatat tttgattttt tatttaataa ttaatattca
      661 agtaacttga agagttggca ggactccggg agtccgtgga gtccaaaatt atataaattt
      721 tagaaaattt tccattgtca ttggaaaatg atggaatgta atgaaaattg gtgaaaagtg
      781 atgaaaatga tggaatcaag cctaacttta tatcaaagtt tttaaagcat ttattttaga
      841 aattttggga atgaggcttt tctgaaatat tcgatgtcaa taattatact gaaaacaatc
      901 ctttaatgtg ccgtaagaga atggtacaat tcaggtagtc aagtcatttg tagataatta
      961 ataaatcaaa atttttatta cagcgaattt ccatcaagtg ctaagaaaat aatgtgatac
     1021 atattattca aacctctttc ccatgcctta gcaacgttat cagaattcgt taaattgagc
     1081 ttttgagatt ccacgtcctc tatgactagc acccaattga ttgagagtca ttgcagctgg
     1141 aacacctgat tcatcatttg aaagaccagc catttcggaa aagaatccaa caattcggcc
     1201 ggcgtgtatg gcgaaagcag ctgtgtcctt taaagattca acatcctttc cagcgaattg
     1261 tgggaaatgt gctcgaatgt ctgggttggc ggtgaaaatt gcagcaagta tgtcaacttc
     1321 attgtgccta acctgatccc atgatgatct aataagaatt gcttgatcat cttccaatgg
     1381 aacaatgtca caactgacag caacaatgca caaagttaaa actgatttca aacttaaata
     1441 agctaattat aaataattat gtggtttata aatttttctt agagatcgcg ggtaaaatat
     1501 gtttatatat ttttctgaaa aacatctagt tatcttcttt aatcagtagt cagggataga
     1561 taggaggtgt cactggcctc tgttgactgt cgattacatg taacgttcgg tggcatttgc
     1621 cgctttcatc ctactgtgtg tcaactattc tataaactgt tcgatctaag tttgagtgcg
     1681 atggcttggc gaaccttcat tcgttatact atcatcgacc gtgtaacgct ctatcatagc
     1741 taagcatcac attgatctga acagctaaag tattttttta tccataaaat aacgatagat
     1801 tactaaattc aatcaatatt attctataaa aataaataca taaaaaaatt tatttgcttt
     1861 aagaaaatca ttttttattg caagttatcc gaattatcta catgctttaa acgatgtcaa
     1921 taaatacatg tatattaagg tcctattttt gtttcactag catcaatttt acaatttagc
     1981 gaagatcatt gtgaaggcgt tgtcaagacc ttgtgtccag gcagctccga gtttatcgtt
     2041 ccatgtaacg tggctttgga ggtatgagac gagtgaagca cggaattcgt tgaattgagc
     2101 ctgtggaatt ttacgagctt tgtggttggc agccatttcg ttgataagtg ttaccatggc
     2161 tggtctgttg gcttcatttc ccataagagc aataatttgt gacatgaatc cgacgattct
     2221 tccggcatgt gtggcgaatg cagctgtgtc tttcaatgag tcgagatcct tgccaacgaa
     2281 ttgtgggaac ttagccatga tgtctgggtt ggccttaaag atgtagtaaa ggatatcaac
     2341 ttcactatgt ttggcttgat tccatgatgc acggaccaat gaggcttcat cagctgacaa
     2401 tggggcggca atagcgccaa caatgcagag agcaagaact aagaatttca ttttactgcc
     2461 tggatttgtt aattacttca aattcaattg aaaacttaac tgctttgatg aaatgcacct
     2521 gcttttatac tccaaattgt acacgtgaaa gtttccttct ttttgaatat tatataataa
     2581 ttaatgagag aaagaaatgg ggaaataagc atcattcaca ttaagaagaa tgataagagg
     2641 cactttgttg catcagaaac aacaacaaga aatcgcattt tagacattaa aattctgaag
     2701 taaaaatttt gcaatcattt ccaaatgttt aaaagtcatt atatgcttct ttttgggaat
     2761 atgtcagatg tgtctatgtt ctcaatagtt taagtgtttt gtagtttgaa taaataataa
     2821 ctatttaatt aaatttaact tatcaatgtt ggattaaaaa actataatcc tcctctatct
     2881 ttcaacattt ttatatgttt atgtggaaaa tttaagatag tacagatgca gtcgttagaa
     2941 gtcacatgga atgtctcatt gcattagatc cccttactcc ctcctctaaa tgatggacgt
     3001 cataagcgaa taaaccttaa gtaaacaatt ttaatgtcat tgaccaagga gggattggaa
     3061 aacaccacta aaagtgtgta caaaatcata cggcaaatat agaaaacacg gatatttttc
     3121 gagggaaaat actcgaaaat gtgtataaat tgtagcagaa ttccgacact actttagcgt
     3181 tacaaaataa ccaaatgata ttcatggagt acgatttttt aaacaaacga gaaacaaaaa
     3241 aattgacaaa ttgataaaaa taaggtgtat aatttaataa aatcggtgga gcaaatgggg
     3301 taaaattaca ttgctgatgg tatatcatgg agattttgga cttatattgt aaacttgttt
     3361 acacttagtt tatttttaaa aaaatattat gtttacttat gtatttcaat attaatgtat
     3421 gtataatata ttttaggaga tcaacttaat tttagatttt tataatattc aattgatgat
     3481 ttttgatgcg tgaaaatcca gtttataagt tttgattttg tcaatgaaga caattgattt
     3541 ttcaattctg cacagttgta tgacccaggg tacatcaatg caatgcctag tcacaatttt
     3601 tagaacagcg aaaaataatt ctaatctgac aagatattcc ctgagatcaa accagggtat
     3661 ttaaagtctg ttggctgaca ctttaatcac tacgccactg ctgcactaag ttttttatat
     3721 attaaaaaaa tttaattgag tcgctggtac caaaagctag gggtcccttc cagctttacc
     3781 aaattgacgc acatgcacta attgcaagga atttcgctaa atttctagac cacctaaaat
     3841 gtaacaaaat ttaaagaaaa attcatcgac tagagcaaac atatctaaaa attaaataat
     3901 aaagcgttca tgttaatttt taatacgttg acttttatta aaaatgtttc ataacatgag
     3961 gtgttgtttc atgtatttgt acttcaatca atgcttgatt tacaagtggg cgaagatagc
     4021 gtcgaagatg ttgtcaagac catgagtcca ggcagcggct gtggcatcat tccatgtagt
     4081 atggtgtgaa aggtatgagg tcattgaggc acggaattcg ttgaattgag cctttgagat
     4141 tccacggttg tgatggttag ttgccaattc gttggtaagg gtgttcatgg ctgggcggtt
     4201 agcttcgttt ccaacaagag caacaatttc tgagatgaat ccgacgattc ttccggcgtg
     4261 tgtggcgaaa gcagctgtgt ccttcaatga gtcgacgtcc tttccggcga attgtgggaa
     4321 acgggcttgg atatctgggt gatctttgaa gacggcagca aggatgtcaa cttcattgtg
     4381 cttaacttga gcccatgagc tacggatgag atcagcttgg tcagctgaaa ttggatcggc
     4441 gacggcaccg acgatgcaga gagcgagaac agcgaggaat ttcatcttgt cttgatttgt
     4501 ttgttgcttc aagttgaatt gaaaacttag tttgcttaga caaattttgc tccttttata
     4561 ctaaaattac gcgtgagaat tctaaaagag attagacatg ttacgcaaaa ataaaaagtt
     4621 tgaacatttt aaacctgaca aaattgtaaa catgataagg atggttacgt tgtgaaacga
     4681 tgacgaaatt attagttatg tgtttaatcg ctagcttatg tcagagtttt catttacata
     4741 ataagattcc gcaatatttc gggagatttc gtttagttac ttatttcgga attttcagtc
     4801 attgaatcaa tcattaaaaa atgctagaca gatttgattt caacaaacat tttactgact
     4861 gcaaacattg tcaatcatat gctgaaacat tgttttgacg ctcatgaatt tttttgattt
     4921 attttctttt taacaaattt tcctgttcga gtgaagtatt ttatgcattc tgtcctccag
     4981 ctagctgaac attgtgatcg attgaagcta tcttaatata ataaacaata aatttttgat
     5041 tcgtagattt ttattaaaaa tcatccaaat catggacttt tactatgcac atgctttgag
     5101 ctgttaataa atacatgttc ttattgattc tataaaatca caatttacaa aatttacaaa
     5161 gcggcgaaaa ggagtccgta gatgttatca agtccttgtg tccaggcagc agcaacattg
     5221 tcgccccatg caacattagc ttggaggtat gagacgagtg aggcacggaa ttcattgaat
     5281 tgagcttgtg agattccacg ggccttgtgg ctggcagcga gttgtccgac gagtgtttgg
     5341 acagctgggg catttgattc gtttccgacg agagcaatga tttctgagac aaatccgacg
     5401 attcttccgg cgtgtgtggc gaaagcagct gtgtccttca atgcagcaac atcctttcca
     5461 gcgaattgtg ggaaacgggc ttggatgtct gggttggctg tgaagacggc ggcgaggatg
     5521 tcaacttcgt tatttctaac ttgagcccat gttgattgaa caagtgaggc ttggtcagca
     5581 gtcaatggag tggcgatggc accgacgatg cagagggcga gaacagcgaa gaatttcatt
     5641 ttgtcaggat tagtttatag attcaaattc aatcgaaaac ttaactcaat ttaaaaattc
     5701 ctaaaccttt tatacaaaaa atccgaatga gcattgatcg taagtgagaa ttaaaattgt
     5761 tgaaataatt tagaaaaaag actttgtcat aatatagcat tgaccgcaat tttgataaga
     5821 aattcatctg atctagacta aacaaaattt tacgatttaa cagcatgtac gtgaggtagt
     5881 tttcatttct ataatctaat ctgtgtgacg aggttccaga cgatttcttc ttattcttaa
     5941 taattctcaa attaattgat ataaattcta aaaagaccat acgcaacacg tagcgacatt
     6001 gaagtatgaa tttttcattt ctttaaaagt gggaattgaa caaggaagag ctggaaaggc
     6061 tagtaattat tatacaataa aaaaaaattt taatctaaat ttcacacaaa aattaaaaaa
     6121 agaaaatgaa aatgaatttt aagacctttt tcaaacaaat ataacaaatt tggtcattgt
     6181 tggccttttt gaaaaatttt atttcacacg gtttgaaaca cactgtttta ggtagtgtta
     6241 tcgcaaagga cgatggtgct gacaacgaag gatatgatag gactcgactg gccagtgcaa
     6301 atttcagtcc cttttttcat atgatcattt cgtaactaaa atttttccaa tcatcaaata
     6361 gcaactctga cttgcaggtg atgaaaggtt aataaatcat cgcttgcgag atcgacagaa
     6421 agaaatttga ctggatcagg tacactttaa ggaagcatga aacagatcta accaaatagg
     6481 aactgttttg attgtctcga gctcgagaaa aaaaaacaga attgtaacaa atgaagctag
     6541 taaaagcaac tgtccatata agaaactgtt caacagctct tttagtgccc taaaattcac
     6601 agtggataaa tagaaaatca ttatttttac ataattttta tttgatgtga aatagaataa
     6661 attgttctca ataaagaatc tgacagagtc gaggctgctc agcattgcaa aaaagccaca
     6721 tacttgaata aaatttgttc gtcaaaaaag aaaaaataat aatgaggaaa tgaagaaaat
     6781 aattttcatt attcaataat ttttttttaa attactctaa aaagactttg aagttattca
     6841 taaatgacat tcaccgttta tgggagaaaa gttctagaat tttgtgacag ataatgacat
     6901 agggggatgc ggatcaaaaa atcaattttc aatgaacttc atttataaac aaccccttta
     6961 gaaaaatctg aaacatttag aggactttta aaaagct
//