GenBank-Updates@genbank.bio.net (05/27/91)
LOCUS RCACRTAK 10068 bp ds-DNA BCT 27-MAY-1991 DEFINITION Rhodobacter capsulatus clustered genes crtA-F, I and K involved in carotenoid biosynthesis ACCESSION X52291 KEYWORDS crtA gene; crtB gene; crtC gene; crtD gene; crtE gene; crtF gene; crtI gene; crtK gene; crtK protein; hydroxyneurosporene methyltransferase; hydroxyneurosporene synthase; methoxyneurosporene dehydrogenase; photosynthesis; phytoene dehydrogenase; phytoene synthase; prephytoene pyrophosphate synthase; spheroidene monooxygenase. SOURCE Rhodobacter capsulatus DNA. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 10068) AUTHORS Armstrong,G.A., Alberti,M., Leach,F. and Hearst,J.E. TITLE Nucleotide sequence, organisation and nature of the protein products of the carotenoid biosynthesis gene cluster of Rhodobacter capsulatus JOURNAL Mol. Gen. Genet. 216, 254-268 (1989) STANDARD full automatic REFERENCE 2 (bases 1 to 10068) AUTHORS Alberti,M. JOURNAL Unpublished (1990) STANDARD full automatic COMMENT SWISS-PROT; P17054; CRTI$RHOCA. SWISS-PROT; P17055; CRTA$RHOCA. SWISS-PROT; P17056; CRTB$RHOCA. SWISS-PROT; P17057; CRTK$RHOCA. SWISS-PROT; P17058; CRTC$RHOCA. SWISS-PROT; P17059; CRTD$RHOCA. SWISS-PROT; P17060; CRTE$RHOCA. SWISS-PROT; P17061; CRTF$RHOCA. Rhodobacter capsulatus is a pseudonym for Rhodopseudomonas capsulata Start codon for crtI [ref. 2] and crtF is GTG. *source: strain=SB1003; clone=pRPS404, subclones: pFL227, pFL268 *source: pFL104, pFL103, pGABX1, pGABX2 Gene: Product CrtA: spheroidene monooxygenase CrtI: phytoene dehydrogenase CrtB: prephytoene pyrophosphate synthase CrtK: crtK protein CrtC: hydroxyneurosporene synthase CrtD: methoxyneurosporene dehydrogenase CrtE: phytoene synthase CrtF: hydroxyneurosporene methyltransferase From EMBL entry RCCRTAK; dated 31-MAY-1990. FEATURES Location/Qualifiers CDS complement(5..1777) /note="crtA gene product (AA 1-591)" /codon_start=1777 misc_feature complement(1794..1818) /note="palindromic motif (pot. binding site for regulatory protein)" misc_feature 1871..1895 /note="palindromic motif (pot. binding site for regulatory protein)" promoter 1873..1900 /note="E. coli-like sigma-70 promoter" CDS 1935..3506 /note="crtI gene product (AA 1-524)" /codon_start=1935 conflict 2034..2036 /note="crtI GTG start in ref [2]" /citation=[1] CDS 3506..4522 /note="crtB gene product (AA 1-339)" /codon_start=3506 terminator 3589..3623 /note="pot. rho-independent terminator" terminator 4505..4538 /note="pot. rho-independent terminator" CDS 4601..5080 /note="crtK gene product (AA 1-160)" /codon_start=4601 terminator 5095..5135 /note="pot. rho-independent terminator" terminator complement(5157..5225) /note="pot. rho-independent terminator" CDS complement(5321..6163) /note="crtC gene product (AA 1-281)" /codon_start=6163 CDS complement(6232..7713) /note="crtD gene product (AA 1-494)" /codon_start=7713 misc_feature 7778..7802 /note="palindromic motif (pot. binding site for regulatory protein)" promoter complement(7791..7818) /note="E. coli-like sigma-70 promoter" CDS 7850..8716 /note="crtE gene product (AA 1-289)" /codon_start=7850 CDS 8722..9900 /note="crtF gene product (AA 1-393)" /codon_start=8722 terminator 9885..9928 /note="pot. rho-independent terminator" promoter 9983..10011 /note="E. coli-like sigma-70 promoter" misc_feature 9994..10018 /note="palindromic motif (pot. binding site for regulatory protein)" CDS 10066..>10068 /note="bchC gene product (ORFJ)" /codon_start=10066 BASE COUNT 1676 a 3292 c 3350 g 1750 t ORIGIN 1 gttacggcaa cgtttcttcg accgtccggg ccacgcgggc cgtcgagccg gcttcatcaa 61 ggggatcgcg gcgcagacgg tgcgacagcg ccatcgttgc cacccgcttc aggtgatcgc 121 gaccgaccgc ggttgcccct tccagcgccg caagggcacg ggccgatcgc agaagcgtca 181 gctcgccccg cagaccgtcg gaccccagcg cgatgcaaag cgcggcgcag tcataaagcg 241 cggtgttcgg cgcctcgacc ttcggcagac gttcacgcgc ttccagaatc tggttgcgga 301 tgtccatgtc cttgggccgc cattcttcca gaaaggcttt cggatctgcg tcataggtgt 361 cgcggcgacg gatcacctcg acccgggttt cgacgtcgcg gggcgacaga acctcgaccg 421 agaggccgaa acggtccaga agctgcggcc gcaggtcgcc ctcttccggg ttgcccgagc 481 cgaccagaac gaaacgggcg gggtgacgga tcgagaggcc gtcgcgttcc accacgttct 541 cgcccgattg cgccacgtcg agcagaagat cgacgatgtg atcttccagc aggttgcatt 601 cgtcgatgta aaggtaaccg cggttggccc gggccagaag gcccggttca aacgcctttt 661 cccctttcga gatcgcgcgc tcgatatcca gcgcccccac cacgcggtct tccgacacgc 721 cgagcggcag atcgacgacc ggggtcggct tgcggatcac gttggtcgaa agcacggtcg 781 cccaatcggg gatcatctcg acattgggcg aggacaccgg gcagccctcg accgcctcga 841 tttcgggcaa aagcgccgcc agcgcccgca ccgcggtcga tttgccggtg ccgcggtcac 901 caaagaccag aacgccgccg atgccgggat cgaccgcggt caacagcagc gccagtttca 961 tgtcttcctg accgacgatg gcagaaaagg gaaagacggg tctcgtcttc gcgcccgaag 1021 cagagggttg aagtcgagcg acggcggtag tcatgcagtt tccctttcca agactttgct 1081 ggcgagcggg tccttgccca tccagcttcc ctcggtcccc aagcaaacgg aaccgggcat 1141 agagttcttc ggtgaaccaa ccttcttcac gcgcagcacg gatggccttg ccatgcggcg 1201 tgtcgccgcg ggcgaaggcg ttcatctttg ccacatccgg ccagatcgag aaagtcactt 1261 gatggaaaag gggaatttcc ccgatcccga tcttgaacat caggttctga tcctcgccga 1321 ctttctcgga aatcttcggc acgcgcgacc agaaggcatt ggccttgtgc ggcttgatcg 1381 cggcgcgggt cagcgccacc acgggctcgt ccgggcttgg ctcggccacc tgttcgggca 1441 ggaacggatt gacgccgccc caggtgccgc gcgccgaaag cggctgcaga tgcagcacca 1501 gcgtttccgc cgcatgcgcc cgccagcgct tccagaccgg gtggttcgcg gtcacgtcgc 1561 gggcatcggc ctcggtgtcg aaagccgcca tgatcgccca gacgcgccag ttcggcttcg 1621 gcgtgaagcc ctcgccggtg cccgatccgc acagtttgta gaacttcacc cgcggctcgt 1681 cgttcagggg gcggcgggac aggatcatct ggctgatgac ccagggcagc gagcttgtgc 1741 cgtcgaaacg gaacaggctg agtgaggcga caggcatcta cgtcctcccc tgtgatgtgt 1801 aacgggatat ttacatctgg gtcccttgta atggaaggcc tcagacgttt tgtcgcgaga 1861 cagcgtcgac agttgtaaat cggaattgac gacctatcat ccccccaatg caacctgaaa 1921 ctaccgaaga aaccatgtcc aagaacacag aaggtatggg tcgcgccgtt gtcatcggtg 1981 ccggccttgg cggtcttgct gcagcgatgc ggctgggcgc aaaaggttac aaggtgacgg 2041 tcgtcgatcg tctggatcgg cccggcgggc gtggctcttc gatcaccaag ggcgggcatc 2101 gattcgacct tgggccgacg atcgtgacgg tgcccgaccg gctgcgcgag ctttgggccg 2161 attgcgggcg cgatttcgac aaggacgtga gccttgtgcc gatggagccc ttctacacca 2221 tcgatttccc cgatggcgag aaatacaccg cttacggcga tgacgccaag gtcaaggccg 2281 aggtggcgcg gatcagcccc ggcgatgtcg agggcttccg ccatttcatg tgggacgcca 2341 aggcccgtta tgaattcggc tatgaaaacc tcggccgcaa gccgatgagc aagctgtggg 2401 acctgatcaa ggttctgccg actttcggct ggctgcgcgc cgaccgctcg gtctatggcc 2461 atgccaagaa gatggtgaag gacgaccacc tgcgcttcgc gctgtcgttc catccgcttt 2521 tcatcggcgg cgaccccttc catgtgacgt cgatgtatat cctcgtcagc cagctcgaaa 2581 agaaattcgg cgtgcattac gcgatcggcg gcgtgcaggc gattgccgat gcgatggcca 2641 aggtgatcac cgatcagggc ggcgagatgc gcctgaacac cgaggtcgac gagatcctgg 2701 tctcgcgtga cggcaaggcc acgggcatcc ggctgatgga cggcaccgag cttccggcgc 2761 aggttgtcgt ctcgaacgcc gatgcgggcc acacctacaa gcgtctcttg cgcaaccgcg 2821 accgctggcg ctggaccgac gagaagctcg acaagaagcg ctggtcgatg gggcttttcg 2881 tctggtattt cggcaccaag ggtacggcca agatgtggaa ggatgtgggt caccacaccg 2941 tcgtcgtcgg gccgcgctac aaggaacatg tgcaggacat cttcatcaag ggcgagctgg 3001 ccgaggacat gagcctttat gtccaccgtc cctcggtcac tgatccgacc gcggcgccga 3061 aaggcgacga caccttctac gtgctttcgc cggtgccgaa cctcggcttc gacaatggcg 3121 tggactggtc ggtcgaggcc gagaaataca aggccaaggt gctgaaagtg atcgaggaac 3181 ggctgcttcc gggggttgcc gaaaagatca ccgaggaagt ggtcttcacg ccggaaacct 3241 tccgcgaccg ttatctctcg ccgctgggcg cgggcttctc gctggaaccg cggatcctgc 3301 aatcggcctg gttccgcccg cataacgcct cggaagaggt ggacgggctt tatctggtcg 3361 gcgcgggcac ccatccgggc gccggtgtgc cctcggtgat cggctcgggc gagcttgtcg 3421 cgcagatgat cccggatgcg ccgaagcccg agacccccgc ggcggctgcg cccaaggccc 3481 ggacgccccg ggccaaggcg gcgcaatgat cgccgaagcg gatatggagg tctgccggga 3541 gctgatccgc accggcagct actccttcca tgcggcgtcc agagttctgc cggcgcgggt 3601 ccgtgacccc gcgctggcgc tttacgcctt ttgccgcgtc gccgatgacg aagtcgacga 3661 ggttggcgcg ccgcgcgaca aggctgcggc ggttttgaaa cttggcgacc ggctggagga 3721 catctatgcc ggtcgtccgc gcaatgcgcc ctcggatcgg gctttcgcgg cggtggtcga 3781 ggaattcgag atgccgcgcg aattgcccga ggcgctgctg gagggcttcg cctgggatgc 3841 cgaggggcgg tggtatcaca cgctttcgga cgtgcaggcc tattcggcgc gggtggcggc 3901 cgccgtcggc gcgatgatgt gcgtgctgat gcgggtgcgc aaccccgatg cgctggcgcg 3961 ggcctgcgat ctcggtcttg ccatgcagat gtcgaacatc gcccgcgacg tgggcgagga 4021 tgcccgggcg gggcggcttt tcctgccgac cgactggatg gtcgaggagg ggatcgatcc 4081 gcaggcgttc ctggccgatc cgcagcccac caagggcatc cgccgggtca ccgagcggtt 4141 gctgaaccgc gccgaccggc tttactggcg ggcggcgacg ggggtgcggc ttttgccctt 4201 tgactgccga ccggggatca tggccgcggg caagatctat gccgcgatcg gggccgaggt 4261 ggcgaaggcg aaatacgaca acatcacccg gcgtgcccac acgaccaagg gccgcaagct 4321 gtggctggtg gcgaattccg cgatgtcggc gacggcgacc tcgatgctgc cgctctcgcc 4381 gcgggtgcat gccaagcccg agcccgaagt ggcgcatctg gtcgatgccg ccgcgcatcg 4441 caacctgcat cccgaacggt ccgaggtgct gatctcggcg ctgatggcgc tgaaggcgcg 4501 cgaccgcggc ctggcgatgg attgaggccg cgacggttcg tcactggacg tgcgcgcgcc 4561 ccgggtccac aacggggcgt gtccacaacc ggaggccatg atgagcctga ctctctttgc 4621 cgtctatttc gtcgcctgcg cctgcgcggg cgcgaccgga gcgatcttca gccccggcgc 4681 atggtatgac agcctgaaga aaccgagctg ggtgccgccg aactggctgt ttcccgtcgc 4741 ctggtccacg ctttacatcc tgatgtcgat ttcggcggcg cgggtatccg ggctggcgat 4801 ggaaaacgaa ctggccgtgc tgggtctggc cttctgggcg gtgcagatcg cggtcaacac 4861 gctttggacg ccgatcttct tcggcctgca ccggctggcg ggcgggatgc tggttctggt 4921 gcttttgtgg ctgagcgtct ttgccacctg cgtcctgttc tggagcgtcg actggctctc 4981 ggggctgatg ttcgtgccct atgtgatctg ggtgacggtg gccggggcgc tgaatttcag 5041 cgtctggcgg ctcaatccgg gcgaaaagcc gatcacgctt tgatctgctg atcctggtgt 5101 gaccgaggcc atcgcctcgg tcatgcgcaa cctttccccg gggccgccga gggcccaaag 5161 gcactcggcc agcgcccgcc ccgcccatgg cgggcgcttc tcgcccgccg ggacgggcgc 5221 tggcctctgt tgtgcttggg gaggcggtct tgcggatgcg tcgcgcggcg ggcggggaaa 5281 agcggtgctt ttcctgatcc gcccgaaacg gacgatttca gtccttgtgc ttccacccgg 5341 cccggcgcgg cacccgcacg gccagcatcg gcttcagcac cggctggcgg aaccgcacca 5401 gatcgagcgc ctcgtgcatg cccaccgttt cctcgccgtc gatctgcgtg cgcatcagcg 5461 cgcgggaata gaagggcgcc tccagcagcg aatagacccc ggccggatgc gcccccgcat 5521 cgcagcgcgt ttcacgccgc acgaaccagg gcgtgcgctt catcttcacc aggggcggag 5581 ggctgtcgat ctcgcagacc gacccgtccg ggttgatctg cacggcgagc gccaccttgg 5641 tccggtccaa ccgcgtcgca tcgtaaaaac agaccgttcg atccttcagc gggaaccgcc 5701 cccaggtcca gaacgagaag tcttcctcca gcgcgcgggt gccgaaattg gcgtcgaaat 5761 agccgtgtcc ggtccatttg tgcccgggcg ccagatcgac ctcgacatcg gcgatgggcg 5821 cgaaaggccg ccaggtgtgg cccgcatccg gcgtcagccg cacctcgacc ccggtcaccg 5881 cgcgcggcgt cagcacgacc cggcccttga gcttgcccag cttcggcaaa gccccccatt 5941 cgtcaacgtc gatcaccagc tccttgccgg tccaggtcag cttcgacggc ccgacctgaa 6001 agctgtcgcg cgactgccgc agcgcagacc ggccccggtc ggtcatcgtg aagcgcccat 6061 cagtgccggt cgtcaccatg ttgatgcagc agtgattttg cggctcgcga cgacccgacc 6121 agcgatacca cgggctgaag accgagccga tgaaggcgat catcgagaag gccttttcgc 6181 cgtcgtcgga gatgccgtcg caataccacc aggcgtagcc gttcggccct acttcgcggc 6241 ggaaatccgg tccgccagca aggcccgtgc tgcatgcgtc ccggaagtca gcgccatggg 6301 caccccggca cccgggtgcg ttcccccacc cgccagataa agccccttga gccccgtccg 6361 cgccagaggc cgccggaagg tcgccagcgt cccctccggc gagccgccgt agatcgcccc 6421 cagcgagccc gggaaccgcc gcgacagcag cgccggcgtc gtcagcgccc gggtttccgg 6481 gtccgggctg aaggtcagtc ccatcgcggc gagcatcggg aaggtgcgtg ccctgcattg 6541 tgcctcctcc tgaggaaagg gttgatggcc tgccgggccg ttcatgatga tttcaaagcg 6601 ttcgatttcc ggcaccgggg cctgcatctc gcggtcctgc gcacagatgt aaagcgtcgg 6661 ttcctcgggc atctcgcccg cgccgatggg cccgaattcc agctccggat cggcggtgaa 6721 gaagacgttg tgatgcgcga ggtcaacgcc gatcggggtg gcgccaaagg cccagaccca 6781 ggcggaaaga ctgggggcgg ggcggggcga tttctccatc gaggcgcggg cggcatcgcc 6841 cagaagcccg tcgcgcaacg cccccggatc gccgttgaag atgcaggcgc cgcaggggat 6901 cgagacgccg gtctcgatct cgaccgcggt gacccggcct tccttgcgca cgatgcgctt 6961 ggccttggcg ccgtaatgga accgcacgcc cttcgcctcg gccacccggg ccagcgcggc 7021 ggcaacgcca tgcatcccct cgcggatcgc ccagacgccc tgaacctcgg cctgccagat 7081 cagcgagagc accgcgggcg tcgcccccgg acggccgccg acataggtcg catagcggcc 7141 gaagagctgc gccagccgcg gatccttgaa gtgatgggcg agcaggtcgc gcatcgtcag 7201 cccggggcgc agcgcgggcc aaagctgcgg ccgcgtcacc gtggccgcgg cgattcgcca 7261 aagatccggc ttcggcgccg cgatcaccga gcggtggaac gcctcccaca gccccgtcgt 7321 caggtgatcg aagcggcgga atgccgcagc ttccttgtcg cccgcaaagg cccggatcgc 7381 ctcgatattg gcttcggtgt cggtgaacag atcgaggctg gagccgtcgg gccagaaatg 7441 ccgggcgagt cggggcaggg ggatcagcgt cagatgttcc tcggcccggg tgccgcaggc 7501 ggcaaagagc gcgtccagaa catgccgcat cgtcaagacc gtgggccccg tatccgcagg 7561 cccgccgggg gtcggaaccg cccgcgcctt gccgcccggg gcatcgcccg cctcaaccac 7621 cgtgacccga aggcccgccg ccgccgcacc gatggcggcc gcaaggcccc ccattctggc 7681 gccgatcacc acgacgtccg tttcactccg catcgctcgc tcccgcacgc accgccgtcc 7741 gcaccccccc gggagtgcaa ttttgtccca tattcttggg tgtaagtttc agtttacaca 7801 ggtaggtgcg aatgccaatg tgcgtcgtga cgcagcggag ggctctgtca tgtctctgga 7861 taaacgtatc gagtcggcgc tggtcaaggc gctgtcaccc gaggctttgg gtgaatctcc 7921 gccgttgctt gccgccgcgc tgccttacgg ggtgtttccc ggcggcgcgc ggatccggcc 7981 gacgatcctt gtctcggtcg cgctcgcctg tggcgacgat tgcccggcgg tcaccgatgc 8041 cgcggccgtg gcgctggagc tgatgcattg cgcgagcctc gtgcatgacg atctgcccgc 8101 cttcgacaat gccgacatcc ggcgcggcaa gccgagcctt cacaaggcct ataatgaacc 8161 gcttgcggtt ctggcgggcg acagcctgct gatccgcggc ttcgaagtgc tggccgatgt 8221 cggcgccgtc aacccggacc gggcgctgaa gctgatctcg aaactgggtc agctgtcggg 8281 ggcgcgcggc gggatctgcg ccggtcaggc ctgggaaagc gaatccaagg tcgatctggc 8341 cgcctatcat caggcgaaga ccggggcgct gttcattgcc gcgacccaga tgggggcgat 8401 tgcggcgggc tacgaggccg aaccctggtt cgatctgggc atgcggatcg gctcggcctt 8461 ccagatcgcc gacgacctga aagacgcgct gatgtcggcc gaggcaatgg gcaagcccgc 8521 cgggcaggac atcgcgaacg aacgcccgaa tgcggtcaag acgatgggca tcgagggcgc 8581 gcgcaaacat ctgcaagatg tgctggcggg ggcgatcgcc tcgatcccgt cctgccccgg 8641 tgaggcgaag ctggcccaga tggtgcagct ttacgcccac aagatcatgg acatcccggc 8701 cagcgccgag aggggctgat cgtgccgaag gacgaccaca cgggcgcgac ggccgaccgg 8761 accgcgcagc cgacaggaac gggaaagcag ccgctggttc cgggccagcc cggggcggcg 8821 ccggtgcagc cggggcgggt gaatttcttc acccggatcg cgctgtcgca acggctgcat 8881 gaaatcttcg aacgcctgcc gctgatgaac cgcgtcaccc ggcgcgaggg cgaggcgctc 8941 ttcgacatcg tttcgggctt cgtgcaaagc caggttctct tggcgatcgt cgaattccgg 9001 gtgctgcata ttctggccgg ggcctcttgg cccttgccgc aactggccga acgcaccggc 9061 ctggccgagg accggctggc ggtgctgatg caggccgccg ccgccttgaa gctggtgaaa 9121 ttccgccgcg gtctgtggca gcttgccccg cgtggcgccg ccttcatcac cgtgccaggg 9181 ctcgaggcga tggtgcgcca tcaccccgtc ctttaccgcg atctggccga tccggtggct 9241 tttctgaaag gcgacatcga acccgagctg gcgggcttct ggccctatgt cttcgggccg 9301 ctggcgcagg aagatgcggg gctcgccgag cgctattcgc agctgatggc cgacagccag 9361 cgcgtcgtgg ccgatgacac cttgcggctt gtcgatctgc gcgatgccaa gcgggtgatg 9421 gatgtgggcg gcggcaccgg ggccttcctg cgcgtcgtgg ccaagcttta ccccgagctg 9481 cccttgacgc tgttcgacct gccgcatgtg ctgtcggtgg cggaccgctt cagcccgaag 9541 ctcgatttcg cgccgggcag cttccgcgac gatccgatcc cgcagggcgc cgatgtcatc 9601 actttggtgc gcgtgctgta tgaccatcct gacagcgtcg tcgaaccgct tctggccaag 9661 gtgcatgccg ccttgccgcc gggcgggcgt ctgatcatct cggaggcgat ggcggggggc 9721 gcaaaacccg accgtgcctg cgatgtctat ttcgccttct acacgatggc gatgagttcg 9781 gggcgcacgc gttcccccga agagatcaag caaatgcttg aaaaagctgg gttcaccaag 9841 gtgtcgaaac cgcggaccct gcgccccttc atcacctcgg tgatcgaggc cgaacgcggc 9901 tgacacggct gcgttcggac ccggctttga cccgggggtc agaaagtcgc acatccgtct 9961 gtcgcaaaag tgtctaatca aattgacagt cgggcgtgta agttcaatga tacacacagg 10021 cgtgatcagc ccgactctcc ggcccgatca taccgggagc aagaaatg //