GenBank-Updates@genbank.bio.net (05/27/91)
LOCUS RCACRTAK 10068 bp ds-DNA BCT 27-MAY-1991
DEFINITION Rhodobacter capsulatus clustered genes crtA-F, I and K involved in
carotenoid biosynthesis
ACCESSION X52291
KEYWORDS crtA gene; crtB gene; crtC gene; crtD gene; crtE gene; crtF gene;
crtI gene; crtK gene; crtK protein;
hydroxyneurosporene methyltransferase;
hydroxyneurosporene synthase; methoxyneurosporene dehydrogenase;
photosynthesis; phytoene dehydrogenase; phytoene synthase;
prephytoene pyrophosphate synthase; spheroidene monooxygenase.
SOURCE Rhodobacter capsulatus DNA.
ORGANISM Rhodobacter capsulatus
Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria;
Purple nonsulfur bacteria.
REFERENCE 1 (bases 1 to 10068)
AUTHORS Armstrong,G.A., Alberti,M., Leach,F. and Hearst,J.E.
TITLE Nucleotide sequence, organisation and nature of the protein
products of the carotenoid biosynthesis gene cluster of Rhodobacter
capsulatus
JOURNAL Mol. Gen. Genet. 216, 254-268 (1989)
STANDARD full automatic
REFERENCE 2 (bases 1 to 10068)
AUTHORS Alberti,M.
JOURNAL Unpublished (1990)
STANDARD full automatic
COMMENT SWISS-PROT; P17054; CRTI$RHOCA. SWISS-PROT; P17055; CRTA$RHOCA.
SWISS-PROT; P17056; CRTB$RHOCA. SWISS-PROT; P17057; CRTK$RHOCA.
SWISS-PROT; P17058; CRTC$RHOCA. SWISS-PROT; P17059; CRTD$RHOCA.
SWISS-PROT; P17060; CRTE$RHOCA. SWISS-PROT; P17061; CRTF$RHOCA.
Rhodobacter capsulatus is a pseudonym for Rhodopseudomonas
capsulata Start codon for crtI [ref. 2] and crtF is GTG. *source:
strain=SB1003; clone=pRPS404, subclones: pFL227, pFL268 *source:
pFL104, pFL103, pGABX1, pGABX2 Gene: Product CrtA: spheroidene
monooxygenase CrtI: phytoene dehydrogenase CrtB: prephytoene
pyrophosphate synthase CrtK: crtK protein CrtC:
hydroxyneurosporene synthase CrtD: methoxyneurosporene
dehydrogenase CrtE: phytoene synthase CrtF: hydroxyneurosporene
methyltransferase
From EMBL entry RCCRTAK; dated 31-MAY-1990.
FEATURES Location/Qualifiers
CDS complement(5..1777)
/note="crtA gene product (AA 1-591)"
/codon_start=1777
misc_feature complement(1794..1818)
/note="palindromic motif (pot. binding site for regulatory
protein)"
misc_feature 1871..1895
/note="palindromic motif (pot. binding site for regulatory
protein)"
promoter 1873..1900
/note="E. coli-like sigma-70 promoter"
CDS 1935..3506
/note="crtI gene product (AA 1-524)"
/codon_start=1935
conflict 2034..2036
/note="crtI GTG start in ref [2]"
/citation=[1]
CDS 3506..4522
/note="crtB gene product (AA 1-339)"
/codon_start=3506
terminator 3589..3623
/note="pot. rho-independent terminator"
terminator 4505..4538
/note="pot. rho-independent terminator"
CDS 4601..5080
/note="crtK gene product (AA 1-160)"
/codon_start=4601
terminator 5095..5135
/note="pot. rho-independent terminator"
terminator complement(5157..5225)
/note="pot. rho-independent terminator"
CDS complement(5321..6163)
/note="crtC gene product (AA 1-281)"
/codon_start=6163
CDS complement(6232..7713)
/note="crtD gene product (AA 1-494)"
/codon_start=7713
misc_feature 7778..7802
/note="palindromic motif (pot. binding site for regulatory
protein)"
promoter complement(7791..7818)
/note="E. coli-like sigma-70 promoter"
CDS 7850..8716
/note="crtE gene product (AA 1-289)"
/codon_start=7850
CDS 8722..9900
/note="crtF gene product (AA 1-393)"
/codon_start=8722
terminator 9885..9928
/note="pot. rho-independent terminator"
promoter 9983..10011
/note="E. coli-like sigma-70 promoter"
misc_feature 9994..10018
/note="palindromic motif (pot. binding site for regulatory
protein)"
CDS 10066..>10068
/note="bchC gene product (ORFJ)"
/codon_start=10066
BASE COUNT 1676 a 3292 c 3350 g 1750 t
ORIGIN
1 gttacggcaa cgtttcttcg accgtccggg ccacgcgggc cgtcgagccg gcttcatcaa
61 ggggatcgcg gcgcagacgg tgcgacagcg ccatcgttgc cacccgcttc aggtgatcgc
121 gaccgaccgc ggttgcccct tccagcgccg caagggcacg ggccgatcgc agaagcgtca
181 gctcgccccg cagaccgtcg gaccccagcg cgatgcaaag cgcggcgcag tcataaagcg
241 cggtgttcgg cgcctcgacc ttcggcagac gttcacgcgc ttccagaatc tggttgcgga
301 tgtccatgtc cttgggccgc cattcttcca gaaaggcttt cggatctgcg tcataggtgt
361 cgcggcgacg gatcacctcg acccgggttt cgacgtcgcg gggcgacaga acctcgaccg
421 agaggccgaa acggtccaga agctgcggcc gcaggtcgcc ctcttccggg ttgcccgagc
481 cgaccagaac gaaacgggcg gggtgacgga tcgagaggcc gtcgcgttcc accacgttct
541 cgcccgattg cgccacgtcg agcagaagat cgacgatgtg atcttccagc aggttgcatt
601 cgtcgatgta aaggtaaccg cggttggccc gggccagaag gcccggttca aacgcctttt
661 cccctttcga gatcgcgcgc tcgatatcca gcgcccccac cacgcggtct tccgacacgc
721 cgagcggcag atcgacgacc ggggtcggct tgcggatcac gttggtcgaa agcacggtcg
781 cccaatcggg gatcatctcg acattgggcg aggacaccgg gcagccctcg accgcctcga
841 tttcgggcaa aagcgccgcc agcgcccgca ccgcggtcga tttgccggtg ccgcggtcac
901 caaagaccag aacgccgccg atgccgggat cgaccgcggt caacagcagc gccagtttca
961 tgtcttcctg accgacgatg gcagaaaagg gaaagacggg tctcgtcttc gcgcccgaag
1021 cagagggttg aagtcgagcg acggcggtag tcatgcagtt tccctttcca agactttgct
1081 ggcgagcggg tccttgccca tccagcttcc ctcggtcccc aagcaaacgg aaccgggcat
1141 agagttcttc ggtgaaccaa ccttcttcac gcgcagcacg gatggccttg ccatgcggcg
1201 tgtcgccgcg ggcgaaggcg ttcatctttg ccacatccgg ccagatcgag aaagtcactt
1261 gatggaaaag gggaatttcc ccgatcccga tcttgaacat caggttctga tcctcgccga
1321 ctttctcgga aatcttcggc acgcgcgacc agaaggcatt ggccttgtgc ggcttgatcg
1381 cggcgcgggt cagcgccacc acgggctcgt ccgggcttgg ctcggccacc tgttcgggca
1441 ggaacggatt gacgccgccc caggtgccgc gcgccgaaag cggctgcaga tgcagcacca
1501 gcgtttccgc cgcatgcgcc cgccagcgct tccagaccgg gtggttcgcg gtcacgtcgc
1561 gggcatcggc ctcggtgtcg aaagccgcca tgatcgccca gacgcgccag ttcggcttcg
1621 gcgtgaagcc ctcgccggtg cccgatccgc acagtttgta gaacttcacc cgcggctcgt
1681 cgttcagggg gcggcgggac aggatcatct ggctgatgac ccagggcagc gagcttgtgc
1741 cgtcgaaacg gaacaggctg agtgaggcga caggcatcta cgtcctcccc tgtgatgtgt
1801 aacgggatat ttacatctgg gtcccttgta atggaaggcc tcagacgttt tgtcgcgaga
1861 cagcgtcgac agttgtaaat cggaattgac gacctatcat ccccccaatg caacctgaaa
1921 ctaccgaaga aaccatgtcc aagaacacag aaggtatggg tcgcgccgtt gtcatcggtg
1981 ccggccttgg cggtcttgct gcagcgatgc ggctgggcgc aaaaggttac aaggtgacgg
2041 tcgtcgatcg tctggatcgg cccggcgggc gtggctcttc gatcaccaag ggcgggcatc
2101 gattcgacct tgggccgacg atcgtgacgg tgcccgaccg gctgcgcgag ctttgggccg
2161 attgcgggcg cgatttcgac aaggacgtga gccttgtgcc gatggagccc ttctacacca
2221 tcgatttccc cgatggcgag aaatacaccg cttacggcga tgacgccaag gtcaaggccg
2281 aggtggcgcg gatcagcccc ggcgatgtcg agggcttccg ccatttcatg tgggacgcca
2341 aggcccgtta tgaattcggc tatgaaaacc tcggccgcaa gccgatgagc aagctgtggg
2401 acctgatcaa ggttctgccg actttcggct ggctgcgcgc cgaccgctcg gtctatggcc
2461 atgccaagaa gatggtgaag gacgaccacc tgcgcttcgc gctgtcgttc catccgcttt
2521 tcatcggcgg cgaccccttc catgtgacgt cgatgtatat cctcgtcagc cagctcgaaa
2581 agaaattcgg cgtgcattac gcgatcggcg gcgtgcaggc gattgccgat gcgatggcca
2641 aggtgatcac cgatcagggc ggcgagatgc gcctgaacac cgaggtcgac gagatcctgg
2701 tctcgcgtga cggcaaggcc acgggcatcc ggctgatgga cggcaccgag cttccggcgc
2761 aggttgtcgt ctcgaacgcc gatgcgggcc acacctacaa gcgtctcttg cgcaaccgcg
2821 accgctggcg ctggaccgac gagaagctcg acaagaagcg ctggtcgatg gggcttttcg
2881 tctggtattt cggcaccaag ggtacggcca agatgtggaa ggatgtgggt caccacaccg
2941 tcgtcgtcgg gccgcgctac aaggaacatg tgcaggacat cttcatcaag ggcgagctgg
3001 ccgaggacat gagcctttat gtccaccgtc cctcggtcac tgatccgacc gcggcgccga
3061 aaggcgacga caccttctac gtgctttcgc cggtgccgaa cctcggcttc gacaatggcg
3121 tggactggtc ggtcgaggcc gagaaataca aggccaaggt gctgaaagtg atcgaggaac
3181 ggctgcttcc gggggttgcc gaaaagatca ccgaggaagt ggtcttcacg ccggaaacct
3241 tccgcgaccg ttatctctcg ccgctgggcg cgggcttctc gctggaaccg cggatcctgc
3301 aatcggcctg gttccgcccg cataacgcct cggaagaggt ggacgggctt tatctggtcg
3361 gcgcgggcac ccatccgggc gccggtgtgc cctcggtgat cggctcgggc gagcttgtcg
3421 cgcagatgat cccggatgcg ccgaagcccg agacccccgc ggcggctgcg cccaaggccc
3481 ggacgccccg ggccaaggcg gcgcaatgat cgccgaagcg gatatggagg tctgccggga
3541 gctgatccgc accggcagct actccttcca tgcggcgtcc agagttctgc cggcgcgggt
3601 ccgtgacccc gcgctggcgc tttacgcctt ttgccgcgtc gccgatgacg aagtcgacga
3661 ggttggcgcg ccgcgcgaca aggctgcggc ggttttgaaa cttggcgacc ggctggagga
3721 catctatgcc ggtcgtccgc gcaatgcgcc ctcggatcgg gctttcgcgg cggtggtcga
3781 ggaattcgag atgccgcgcg aattgcccga ggcgctgctg gagggcttcg cctgggatgc
3841 cgaggggcgg tggtatcaca cgctttcgga cgtgcaggcc tattcggcgc gggtggcggc
3901 cgccgtcggc gcgatgatgt gcgtgctgat gcgggtgcgc aaccccgatg cgctggcgcg
3961 ggcctgcgat ctcggtcttg ccatgcagat gtcgaacatc gcccgcgacg tgggcgagga
4021 tgcccgggcg gggcggcttt tcctgccgac cgactggatg gtcgaggagg ggatcgatcc
4081 gcaggcgttc ctggccgatc cgcagcccac caagggcatc cgccgggtca ccgagcggtt
4141 gctgaaccgc gccgaccggc tttactggcg ggcggcgacg ggggtgcggc ttttgccctt
4201 tgactgccga ccggggatca tggccgcggg caagatctat gccgcgatcg gggccgaggt
4261 ggcgaaggcg aaatacgaca acatcacccg gcgtgcccac acgaccaagg gccgcaagct
4321 gtggctggtg gcgaattccg cgatgtcggc gacggcgacc tcgatgctgc cgctctcgcc
4381 gcgggtgcat gccaagcccg agcccgaagt ggcgcatctg gtcgatgccg ccgcgcatcg
4441 caacctgcat cccgaacggt ccgaggtgct gatctcggcg ctgatggcgc tgaaggcgcg
4501 cgaccgcggc ctggcgatgg attgaggccg cgacggttcg tcactggacg tgcgcgcgcc
4561 ccgggtccac aacggggcgt gtccacaacc ggaggccatg atgagcctga ctctctttgc
4621 cgtctatttc gtcgcctgcg cctgcgcggg cgcgaccgga gcgatcttca gccccggcgc
4681 atggtatgac agcctgaaga aaccgagctg ggtgccgccg aactggctgt ttcccgtcgc
4741 ctggtccacg ctttacatcc tgatgtcgat ttcggcggcg cgggtatccg ggctggcgat
4801 ggaaaacgaa ctggccgtgc tgggtctggc cttctgggcg gtgcagatcg cggtcaacac
4861 gctttggacg ccgatcttct tcggcctgca ccggctggcg ggcgggatgc tggttctggt
4921 gcttttgtgg ctgagcgtct ttgccacctg cgtcctgttc tggagcgtcg actggctctc
4981 ggggctgatg ttcgtgccct atgtgatctg ggtgacggtg gccggggcgc tgaatttcag
5041 cgtctggcgg ctcaatccgg gcgaaaagcc gatcacgctt tgatctgctg atcctggtgt
5101 gaccgaggcc atcgcctcgg tcatgcgcaa cctttccccg gggccgccga gggcccaaag
5161 gcactcggcc agcgcccgcc ccgcccatgg cgggcgcttc tcgcccgccg ggacgggcgc
5221 tggcctctgt tgtgcttggg gaggcggtct tgcggatgcg tcgcgcggcg ggcggggaaa
5281 agcggtgctt ttcctgatcc gcccgaaacg gacgatttca gtccttgtgc ttccacccgg
5341 cccggcgcgg cacccgcacg gccagcatcg gcttcagcac cggctggcgg aaccgcacca
5401 gatcgagcgc ctcgtgcatg cccaccgttt cctcgccgtc gatctgcgtg cgcatcagcg
5461 cgcgggaata gaagggcgcc tccagcagcg aatagacccc ggccggatgc gcccccgcat
5521 cgcagcgcgt ttcacgccgc acgaaccagg gcgtgcgctt catcttcacc aggggcggag
5581 ggctgtcgat ctcgcagacc gacccgtccg ggttgatctg cacggcgagc gccaccttgg
5641 tccggtccaa ccgcgtcgca tcgtaaaaac agaccgttcg atccttcagc gggaaccgcc
5701 cccaggtcca gaacgagaag tcttcctcca gcgcgcgggt gccgaaattg gcgtcgaaat
5761 agccgtgtcc ggtccatttg tgcccgggcg ccagatcgac ctcgacatcg gcgatgggcg
5821 cgaaaggccg ccaggtgtgg cccgcatccg gcgtcagccg cacctcgacc ccggtcaccg
5881 cgcgcggcgt cagcacgacc cggcccttga gcttgcccag cttcggcaaa gccccccatt
5941 cgtcaacgtc gatcaccagc tccttgccgg tccaggtcag cttcgacggc ccgacctgaa
6001 agctgtcgcg cgactgccgc agcgcagacc ggccccggtc ggtcatcgtg aagcgcccat
6061 cagtgccggt cgtcaccatg ttgatgcagc agtgattttg cggctcgcga cgacccgacc
6121 agcgatacca cgggctgaag accgagccga tgaaggcgat catcgagaag gccttttcgc
6181 cgtcgtcgga gatgccgtcg caataccacc aggcgtagcc gttcggccct acttcgcggc
6241 ggaaatccgg tccgccagca aggcccgtgc tgcatgcgtc ccggaagtca gcgccatggg
6301 caccccggca cccgggtgcg ttcccccacc cgccagataa agccccttga gccccgtccg
6361 cgccagaggc cgccggaagg tcgccagcgt cccctccggc gagccgccgt agatcgcccc
6421 cagcgagccc gggaaccgcc gcgacagcag cgccggcgtc gtcagcgccc gggtttccgg
6481 gtccgggctg aaggtcagtc ccatcgcggc gagcatcggg aaggtgcgtg ccctgcattg
6541 tgcctcctcc tgaggaaagg gttgatggcc tgccgggccg ttcatgatga tttcaaagcg
6601 ttcgatttcc ggcaccgggg cctgcatctc gcggtcctgc gcacagatgt aaagcgtcgg
6661 ttcctcgggc atctcgcccg cgccgatggg cccgaattcc agctccggat cggcggtgaa
6721 gaagacgttg tgatgcgcga ggtcaacgcc gatcggggtg gcgccaaagg cccagaccca
6781 ggcggaaaga ctgggggcgg ggcggggcga tttctccatc gaggcgcggg cggcatcgcc
6841 cagaagcccg tcgcgcaacg cccccggatc gccgttgaag atgcaggcgc cgcaggggat
6901 cgagacgccg gtctcgatct cgaccgcggt gacccggcct tccttgcgca cgatgcgctt
6961 ggccttggcg ccgtaatgga accgcacgcc cttcgcctcg gccacccggg ccagcgcggc
7021 ggcaacgcca tgcatcccct cgcggatcgc ccagacgccc tgaacctcgg cctgccagat
7081 cagcgagagc accgcgggcg tcgcccccgg acggccgccg acataggtcg catagcggcc
7141 gaagagctgc gccagccgcg gatccttgaa gtgatgggcg agcaggtcgc gcatcgtcag
7201 cccggggcgc agcgcgggcc aaagctgcgg ccgcgtcacc gtggccgcgg cgattcgcca
7261 aagatccggc ttcggcgccg cgatcaccga gcggtggaac gcctcccaca gccccgtcgt
7321 caggtgatcg aagcggcgga atgccgcagc ttccttgtcg cccgcaaagg cccggatcgc
7381 ctcgatattg gcttcggtgt cggtgaacag atcgaggctg gagccgtcgg gccagaaatg
7441 ccgggcgagt cggggcaggg ggatcagcgt cagatgttcc tcggcccggg tgccgcaggc
7501 ggcaaagagc gcgtccagaa catgccgcat cgtcaagacc gtgggccccg tatccgcagg
7561 cccgccgggg gtcggaaccg cccgcgcctt gccgcccggg gcatcgcccg cctcaaccac
7621 cgtgacccga aggcccgccg ccgccgcacc gatggcggcc gcaaggcccc ccattctggc
7681 gccgatcacc acgacgtccg tttcactccg catcgctcgc tcccgcacgc accgccgtcc
7741 gcaccccccc gggagtgcaa ttttgtccca tattcttggg tgtaagtttc agtttacaca
7801 ggtaggtgcg aatgccaatg tgcgtcgtga cgcagcggag ggctctgtca tgtctctgga
7861 taaacgtatc gagtcggcgc tggtcaaggc gctgtcaccc gaggctttgg gtgaatctcc
7921 gccgttgctt gccgccgcgc tgccttacgg ggtgtttccc ggcggcgcgc ggatccggcc
7981 gacgatcctt gtctcggtcg cgctcgcctg tggcgacgat tgcccggcgg tcaccgatgc
8041 cgcggccgtg gcgctggagc tgatgcattg cgcgagcctc gtgcatgacg atctgcccgc
8101 cttcgacaat gccgacatcc ggcgcggcaa gccgagcctt cacaaggcct ataatgaacc
8161 gcttgcggtt ctggcgggcg acagcctgct gatccgcggc ttcgaagtgc tggccgatgt
8221 cggcgccgtc aacccggacc gggcgctgaa gctgatctcg aaactgggtc agctgtcggg
8281 ggcgcgcggc gggatctgcg ccggtcaggc ctgggaaagc gaatccaagg tcgatctggc
8341 cgcctatcat caggcgaaga ccggggcgct gttcattgcc gcgacccaga tgggggcgat
8401 tgcggcgggc tacgaggccg aaccctggtt cgatctgggc atgcggatcg gctcggcctt
8461 ccagatcgcc gacgacctga aagacgcgct gatgtcggcc gaggcaatgg gcaagcccgc
8521 cgggcaggac atcgcgaacg aacgcccgaa tgcggtcaag acgatgggca tcgagggcgc
8581 gcgcaaacat ctgcaagatg tgctggcggg ggcgatcgcc tcgatcccgt cctgccccgg
8641 tgaggcgaag ctggcccaga tggtgcagct ttacgcccac aagatcatgg acatcccggc
8701 cagcgccgag aggggctgat cgtgccgaag gacgaccaca cgggcgcgac ggccgaccgg
8761 accgcgcagc cgacaggaac gggaaagcag ccgctggttc cgggccagcc cggggcggcg
8821 ccggtgcagc cggggcgggt gaatttcttc acccggatcg cgctgtcgca acggctgcat
8881 gaaatcttcg aacgcctgcc gctgatgaac cgcgtcaccc ggcgcgaggg cgaggcgctc
8941 ttcgacatcg tttcgggctt cgtgcaaagc caggttctct tggcgatcgt cgaattccgg
9001 gtgctgcata ttctggccgg ggcctcttgg cccttgccgc aactggccga acgcaccggc
9061 ctggccgagg accggctggc ggtgctgatg caggccgccg ccgccttgaa gctggtgaaa
9121 ttccgccgcg gtctgtggca gcttgccccg cgtggcgccg ccttcatcac cgtgccaggg
9181 ctcgaggcga tggtgcgcca tcaccccgtc ctttaccgcg atctggccga tccggtggct
9241 tttctgaaag gcgacatcga acccgagctg gcgggcttct ggccctatgt cttcgggccg
9301 ctggcgcagg aagatgcggg gctcgccgag cgctattcgc agctgatggc cgacagccag
9361 cgcgtcgtgg ccgatgacac cttgcggctt gtcgatctgc gcgatgccaa gcgggtgatg
9421 gatgtgggcg gcggcaccgg ggccttcctg cgcgtcgtgg ccaagcttta ccccgagctg
9481 cccttgacgc tgttcgacct gccgcatgtg ctgtcggtgg cggaccgctt cagcccgaag
9541 ctcgatttcg cgccgggcag cttccgcgac gatccgatcc cgcagggcgc cgatgtcatc
9601 actttggtgc gcgtgctgta tgaccatcct gacagcgtcg tcgaaccgct tctggccaag
9661 gtgcatgccg ccttgccgcc gggcgggcgt ctgatcatct cggaggcgat ggcggggggc
9721 gcaaaacccg accgtgcctg cgatgtctat ttcgccttct acacgatggc gatgagttcg
9781 gggcgcacgc gttcccccga agagatcaag caaatgcttg aaaaagctgg gttcaccaag
9841 gtgtcgaaac cgcggaccct gcgccccttc atcacctcgg tgatcgaggc cgaacgcggc
9901 tgacacggct gcgttcggac ccggctttga cccgggggtc agaaagtcgc acatccgtct
9961 gtcgcaaaag tgtctaatca aattgacagt cgggcgtgta agttcaatga tacacacagg
10021 cgtgatcagc ccgactctcc ggcccgatca taccgggagc aagaaatg
//