GenBank-Updates@genbank.bio.net (05/25/91)
LOCUS ATHRPII 9235 bp ds-DNA PLN 25-MAY-1991 DEFINITION Arabidopsis thaliana rpII gene for RNA polymerase large subunit (EC 2.7.7.6) ACCESSION X52954 KEYWORDS RNA polymerase; RNA polymerase II; polymerase; rpII gene. SOURCE Arabidopsis thaliana DNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 9235) AUTHORS Nawrath,C. JOURNAL Unpublished (1990) STANDARD full automatic REFERENCE 2 (bases 1 to 9235) AUTHORS Nawrath,C., Schell,J. and Koncz,C. TITLE Homologous domains of the largest subunit of eukaryotic RNA polymerase II are conserved in plants JOURNAL Mol. Gen. Genet. 223, 65-75 (1990) STANDARD full automatic COMMENT *source: strain=Columbus; library=EMBL4, lambda gt11; Data kindly reviewed (25-OCT-1990) by Nawrath C. From EMBL entry ATRPII; dated 19-NOV-1990. FEATURES Location/Qualifiers mRNA <1101..1100 /note="exon 1" CDS 1017..1100 /note="RNA polymerase II large subunit (AA 1-28)" /codon_start=1017 intron 1101..1290 /note="intron I" mRNA 1291..1575 /note="exon 2" CDS 1291..1575 /note="RNA polymerase II large subunit (AA 29-123)" /codon_start=1291 intron 1576..1663 /note="intron II" mRNA 1664..1948 /note="exon 3" CDS 1664..1948 /note="RNA polymerase II large subunit (AA 124-218)" /codon_start=1664 intron 1949..2025 /note="intron III" mRNA 2026..2184 /note="exon 4" CDS 2026..2184 /note="RNA polymerase II large subunit (AA 219-271)" /codon_start=2026 intron 2185..2269 /note="intron IV" mRNA 2270..2431 /note="exon 5" CDS 2270..2431 /note="RNA polymerase II large subunit (AA 272-325)" /codon_start=2270 intron 2432..2513 /note="intron V" mRNA 2514..2769 /note="exon 6" CDS 2514..2769 /note="RNA polymerase II large subunit (AA 326-410) (2769 is 1st base in codon)" /codon_start=2514 intron 2770..2861 /note="intron VI" mRNA 2862..2950 /note="exon 7" CDS 2862..2950 /note="RNA polymerase II large subunit (AA 411-440) (2862 is 2nd base in codon)" /codon_start=2862 intron 2951..3035 /note="intron VII" mRNA 3036..3371 /note="exon 8" CDS 3036..3371 /note="RNA polymerase II large subunit (AA 441-552)" /codon_start=3036 intron 3372..3465 /note="intron VIII" mRNA 3466..3752 /note="exon 9" CDS 3466..3752 /note="RNA polymerase II large subunit (AA 553-648) (3752 is 2nd base in codon)" /codon_start=3466 intron 3753..3928 /note="intron IX" mRNA 3929..4187 /note="exon 10" CDS 3929..4187 /note="RNA polymerase II large subunit (AA 649-734) (3929 is 3rd base in codon)" /codon_start=3929 intron 4188..4272 /note="intron X" mRNA 4273..7355 /note="exon 11" CDS 4273..7355 /note="RNA polymerase II large subunit (AA 735-1762) (7355 is 2nd base in codon)" /codon_start=4273 intron 7356..7466 /note="intron XI" mRNA 7467..7538 /note="exon 12" CDS 7467..7538 /note="RNA polymerase II large subunit (AA 1763-1786) (7467 is 3rd base in codon) (7538 is 2nd base in codon)" /codon_start=7467 intron 7539..7645 /note="intron XII" mRNA 7646..7846 /note="exon 13" CDS 7646..7811 /note="RNA polymerase II large subunit (AA 1787-1841) (7646 is 3rd base in codon)" /codon_start=7646 intron 7847..7934 /note="intron XIII" mRNA 7935..8251 /note="exon 14" BASE COUNT 2597 a 1839 c 1951 g 2848 t ORIGIN 1 tctagatagt ttgaagaatc ctattgagcg atcttaatca aaagaacaaa agaagttata 61 acgaaaacat gcatttgaag ataaaagcta taaagtataa ataattttct ttcttttttc 121 gttgttttgt ttttgtatca gcgtgtgtgt tatatacata acaaagtggg cttcacataa 181 cttgagacta tgcctccgta cgtgtaatat ttatttgtta gttttccgta ttataataat 241 gggttcaccc tactttgcta gagtagaaga ataatttcat tttattaact cccaactcat 301 tcattactca cttttgggat attgaagaag ttaggtattt actattccga agatgaaatg 361 aagataatta atcgtttcaa tctaaattac acattagatt tcttttagcg tagggactcc 421 tcttccttta gagcttcggc cactagcagt taaaacttta atggttcaga tttttcagtc 481 cactttacca aacatggtga aataagggga actattatcg ttttgctctt ttaagaagaa 541 atctatttga cggaaaataa gaagaaatct tatttttgat atatcggttt gaccaactga 601 accgatataa accaatctta ctagcattct tcgttacaac acgtttcttt ggtcagtgcc 661 gacgcaggcc cgttaatccg tttaactggg cctgagacag ttacagccca attaaccaga 721 acagaccttt gagccgccta tttttctata aatagaggct tagggcatgg atcgtgtttc 781 ttttgctcat tcatctctcc tcgtgaagag acgagagaat tccagatctg gtcgtgtcaa 841 aacccagaga gggaaaaaaa aaacggaaaa aagtttagct caagtaactg taaactactg 901 actcacagat tcaaacaaat tggcaaaaga gggtttaatt aatttttgat tcccctttta 961 attcacgttc tagggttttc gatttttgat tcgtctttga tcggagctta gccgccatgg 1021 atacgaggtt tccgttctct ccggccgagg tctctaaagt ccgggtggtc cagtttggca 1081 tactcagccc cgatgagatc gtaatgctcc tttctctttc tctttctctt tcaggtccat 1141 tcgtccgccg tctgtttagc tggcggggat attggcatga atagtagatt ggtttagtta 1201 gggagcatgt tcttttctat ctattgatct taattggaag cttcaagtaa gaaattgagt 1261 ttcactggtt gctaatggct tttttttcag aggcaaatgt ctgttataca tgttgagcat 1321 agtgagacga ccgagaaggg taaacctaag gtgggaggat tgagtgatac ccgtcttggt 1381 acgattgatc ggaaggtgaa gtgtgagaca tgtatggcta atatggctga gtgtccggga 1441 cattttggct atcttgagct cgctaagcca atgtatcatg tcggttttat gaagacagtg 1501 ttaagtatca tgagatgtgt ctgtttcaat tgctccaaga ttttagctga tgaggtatgt 1561 aggagcttat ttcgtgtcag cattgcttgt tttgcttatg tctttgtttg acgtagctgt 1621 tcttcaattg attattttta tgaaggagga gcataagttt aagcaggcta tgaagatcaa 1681 gaatcctaag aataggctta agaagattct ggatgcctgc aaaaacaaga ccaaatgtga 1741 tggtggtgat gacattgacg atgtccaaag ccacagcacg gatgaaccag taaaaaagag 1801 ccgaggtgga tgtggtgcac aacaaccaaa actgactatt gagggtatga agatgattgc 1861 agaatacaaa attcaaagga agaaaaatga tgagccagat cagcttcccg agcctgcaga 1921 aaggaaacag acacttggtg ctgatagggt tcgtcttttc tttcgaatga attttacctc 1981 ttgttcttgt ttggcgtaat ctgatggcat gtgatattct tgcaggtttt gagtgttttg 2041 aaaaggatta gtgacgcgga ttgtcaactc ctaggtttca accctaagtt tgctcgtcct 2101 gactggatga ttcttgaagt ccttcctatt cctccacccc ctgtcagacc atctgtaatg 2161 atggacgcca cttccaggag tgaggtaagt cgactacggt tttctgaata aaacttttct 2221 tagaccaaca tagtgtgtgc ccctgagttt atgcttattg ttgatgcagg atgacttgac 2281 ccatcagcta gctatgatta ttcgacacaa tgaaaacttg aaaaggcagg aaaaaaatgg 2341 agcgccagct catattatat cagagtttac acaactcttg cagtttcata tagctacgta 2401 ttttgataac gagttgcctg gacagccaag ggtaatgctt ttgtatcctt ccagatatct 2461 ggtatgatac atacggatat tgctttatct ggattggctt ttattttctt caggctactc 2521 agaaatcagg gaggcctatt aaatcaatat gtagtaggct gaaggcaaag gaaggcagaa 2581 tcaggggtaa cttgatggga aaacgtgttg atttctcggc acgtactgtt attactccag 2641 atccaacaat aaatattgat gaacttggtg ttccgtggag tattgctctg aatctcacat 2701 acccagaaac agttactccc tataacattg aaaggttagt acgtctggtg tttatttcat 2761 tttctgagag taatttttcc gtgttttact aacaatttag tttggcagat taaaggagct 2821 tgttgattat ggaccacatc ctccacctgg gaagactgga gcgaaatata tcataagaga 2881 tgatggccaa agatcagatc ttcggtatct taagaagagc agtgatcaac atttggaact 2941 tggatacaag gtatgtacta ctttcttatt ctatccactc aacgcacaaa aggttatttc 3001 tggaagtcgt catttttatg ctttcttggt cacaggtgga gcggcattta caggatggtg 3061 attttgttct gtttaatcgt caaccaagtc tgcacaaaat gtctatcatg ggtcacagga 3121 ttaggattat gccatattcc actttccgtc tgaatttgtc tgtcacgtct ccgtacaatg 3181 ctgattttga tggggatgag atgaacatgc atgtaccaca atcattcgag accagagccg 3241 aggtgttaga gctgatgatg gttcctaaat gtattgtctc cccccaggcg aatcgtcctg 3301 tgatgggaat tgtgcaggat accctcttgg ggtgccgtaa aattacaaag agagatactt 3361 tcatagagaa ggtagcaatt tcatcagcgg ttgcttttta aagttatcta tgatattgat 3421 gagaggataa accaactctt atgctgatat cttatccatt ttcaggatgt attcatgaac 3481 acactgatgt ggtgggaaga cttcgatggg aaagttccgg ctcctgcaat cttgaagcct 3541 cgtcctcttt ggactggcaa acaagttttt aatcttatca taccaaaaca gataaatctg 3601 ttgaggtact ctgcttggca cgcagataca gagactggat ttataactcc gggggatact 3661 caagtgcgaa ttgaaagagg ggaacttctt gccggaactc tttgcaaaaa gacccttggt 3721 acatctaatg gaagtctcgt gcatgtcatt tggtaagttg cacatcaaga ttttagagtg 3781 tttattccgg tttgtcacat ttgcttactg gctaaatttt ggtaccttag ttaacttcac 3841 atcacatgca gcattgttgg gaatatctaa atgtagtaat tagtttcatg agctcttttg 3901 ttttaattca tttttacttg gcttccaggg aagaggttgg tcctgatgca gctagaaaat 3961 tcctcggtca tactcaatgg cttgtcaatt actggcttct gcagaatggt tttaccatcg 4021 gaattggtga cacaattgcc gattcatcaa caatggagaa aattaatgaa actatttcca 4081 atgcaaaaac tgctgtgaaa gatcttatcc ggcagttcca gggaaaggaa ttggaccctg 4141 agcctggccg aactatgaga gatacatttg agaacagggt tgaccaggta tattcaagac 4201 aataattagt tgattttcgt ttggttgtca gtgcagtttt tatatgtgct gactcataaa 4261 ctttccttgt aggttttgaa taaagctcgt gatgatgctg gaagtagtgc tcaaaagagt 4321 ttagcagaaa ccaataacct taaggccatg gtgacagcag gatccaaagg aagtttcatc 4381 aatatttctc aaatgacagc gtgtgtcggt cagcaaaatg ttgaagggaa gcgaattcca 4441 tttggatttg atgggcggac attgccacat ttcaccaaag atgattatgg tcctgaaagt 4501 cgtggttttg ttgagaattc gtacctgcgt ggcttgactc ctcaagagtt ctttttccat 4561 gctatgggag gacgagaagg tcttattgat actgctgtga agacatcaga aactggatac 4621 attcagaggc gattggtaaa ggctatggag gatattatgg ttaagtatga tgggacagtc 4681 agaaactctt tgggtgatgt tattcaattt ctctatggag aagatggtat ggatgctgta 4741 tggatagaat cacagaagct ggattccttg aaaatgaaga aatcagagtt tgataggact 4801 tttaagtatg agattgacga cgaaaactgg aatcctactt acctaagtga tgaacatctt 4861 gaagacttga aggggattcg ggagttgcgt gatgtattcg atgcggaata ttcgaaactt 4921 gagactgaca gattccaact cgggacagaa attgcaacaa atggtgatag cacttggcca 4981 ttgcctgtta acatcaagag gcatatctgg aatgcgcaga agactttcaa aattgacttg 5041 cgcaaaattt cagatatgca ccctgttgaa attgttgatg ctgttgataa actacaggag 5101 aggctgttgg ttgttcctgg tgatgatgcg ttgagtgtgg aagcacagaa aaacgcaaca 5161 ttgttcttta acattttgct tcgcagcact cttgctagta aaagagtgtt ggaagaatac 5221 aagctcagcc gcgagcgttt tgagtgggtc attggtgaga ttgaatcaag gtttttacaa 5281 tcgctagtgg ccccagggga aatgatcggt tgtgttgctg ctcaatcaat tggagaacct 5341 gctacgcaga tgactctgaa taccttccat tatgctggtg tcagtgcaaa gaacgttacg 5401 ctcggagttc ccaggttgcg tgaaattatt aatgtagcta agaggatcaa aacaccatcc 5461 ctatcagtct atctcactcc ggaagctagc aaatcaaaag agggggctaa gactgttcag 5521 tgtgctttgg agtatactac tctcaggagt gttactcaag ctacggaagt ctggtatgac 5581 ccagatccaa tgagtacaat aattgaagag gactttgaat ttgtgaggtc ctactatgaa 5641 atgccagatg aagatgtttc cccagataag atatctccgt ggctacttcg tatagagttg 5701 aatcgcgaga tgatggttga taagaaattg agtatggcgg atattgcgga gaagatcaac 5761 cttgagttcg atgatgacct aacttgcata ttcaatgatg ataatgctca aaaactgatc 5821 cttcgaattc gcattatgaa cgatgagggc ccaaagggag agttgcaaga tgaatcggct 5881 gaagatgatg ttttcctcaa aaagattgag agcaacatgc tgacagaaat ggcactcaga 5941 ggtattccag acatcaacaa ggtttttata aaacaggtta gaaagagcag gtttgatgag 6001 gagggaggct tcaagacatc tgaggagtgg atgttggata cagaaggtgt gaacctctta 6061 gctgtcatgt gtcacgaaga tgtggatcca aagaggacaa caagcaatca cttgattgaa 6121 attattgaag ttctcggaat tgaggcagtt cgtcgtgctt tgcttgatga actccgtgtt 6181 gtgatatcct ttgatggttc ttatgtgaat taccgtcatc ttgccatctt gtgtgatact 6241 atgacctatc gcggtcatct gatggctatc actcgacacg gtatcaatag aaatgacact 6301 gggcctctga tgagatgctc ttttgaagaa acagttgata ttctgctaga tgctgcggct 6361 tatgctgaga cagactgctt acggggtgtt actgagaata taatgttggg tcaacttgca 6421 ccaattggga caggagattg tgagttgtat ctgaatgatg agatgctgaa gaatgcaatt 6481 gaacttcagc tccctagcta tatggatggt cttgaatttg gaatgactcc tgctcgttca 6541 ccagtgtcag gcactcctta ccatgaaggc atgatgtctc caaactacct gttaagtcca 6601 aatatgcgtt tatccccaat gtcagatgca cagttttctc catatgttgg tggaatggcc 6661 ttttcgcctt cttcttctcc aggatatagt ccatcatcgc ctggatacag tcctacttct 6721 cccggttaca gtccaacttc gcctggatat agcccgactt ctcccggtta cagtccaact 6781 tcgcctacct acagtcccag ttctcctggc tatagcccaa caagccctgc ttattctcct 6841 acaagtcctt cctattctcc tacctctccg agctacagcc caacgtctcc aagctatagc 6901 ccaacgtcgc caagctacag cccgacatct ccgagctaca gtcctacttc cccaagttac 6961 agcccgactt cgcctgctta cagcccgact tcacctgctt acagcccaac ttcaccagca 7021 tacagcccaa cctctccttc ttacagccca acttcacctt cttacagccc aacatcgcct 7081 tcttacagcc ctacttcacc atcttacagc ccaacatctc cgtcttacag ccctacttca 7141 cccgcatata gccccacatc tcctggctac agccctactt caccaagtta cagtccaaca 7201 tcaccaagtt acagtccaac atcaccaagc tacggtccta cgtctccaag ctacaaccct 7261 cagtctgcta aatatagccc atctatagct tactctccta gcaatgcaag actatcacca 7321 gctagcccct acagtcctac atctcccaac tacaggtaag tagtaatatc ttaattttta 7381 cactttctat gaagcttctc ttatcttgtg gtgatattgt gtcatcttct ctaatcgtta 7441 actcttatta aaaatttact ctgcagcccg acatctccat catactcacc cacatctcca 7501 tcttattcac cttcaagtcc aacatacagt cccagcaggt ttgactttta cccgcttgaa 7561 aatcttggat ctgtttgtaa tgcatccatc tcaattgaat tccgcaaaaa gtttaactgg 7621 gttgttgtta ttaaatctgt ctcagcccat acagctcagg agcaagccca gactacagcc 7681 caagcgcagg ctactcgcca acacttcccg gttattcacc gtcatcaacg ggtcagtata 7741 ccccacatga gggcgataaa aaggacaaga ctggaaaaaa agatgccagt aaggatgata 7801 aaggcaaccc ttgaaagaga gtgagaattg gcaatccatg ttttaggtaa cgtctaaatc 7861 tcttggaacc attggggttt acaagtctta ctagctcacg aacttagcca cttgggatat 7921 cgtgtaaccc gcaggtcatg atttaagaga cagtgaaagc tgagaaaagg aagggaccgt 7981 ttcaaagtga tcttctgtgg ataactttgt gaacaaggtt ttcttaatag atcctttttt 8041 cgtgagttgt atattattcc aaactgatcc ataaatccat ccaatcctca tcccccaaaa 8101 agaaaatagt ataaacatag aaaaacgaaa acatatcaga ctttgggctt ttcgtatggt 8161 ttagtttttg gctttttgct gtgttgttct atttcagaaa gtaaacatgt gaaacggttc 8221 atttgtaatc catgaaagga ttctttatgt tactgctgtt gcttcattga gtagatacga 8281 atcgagaatg ccttttttcc ttgtttccga caattatcga ttgacgtgtg accactttaa 8341 aagtttaaac agctcgactt tccaatatgg gtttatttct tgttttatcc acaccattaa 8401 agaatggttt ttgggatttt tatttatgtg ataattaatc atttttccaa attttatttt 8461 gtatataata aatgaagcaa atgttggaaa actatccaat ggatgtggtg ggttaatatc 8521 accagattcg catagctggt tttttgactt gtcttcttaa ttatttgtcc agaaaaagag 8581 aagaactctt cacatcatca ttgtcaactt tagcattatt gtattagctt tttatttctt 8641 tacgtctaca aagctattgg tacaacgttc taaaatcaaa ttcgtcatca gtagattttg 8701 taaactaatt aagtaaagtt cagtgattaa agaagctaga tgaagaacgt gtgcaacgac 8761 tcctctgaga tctacacgga ataatgtcgt cagtgagcaa acaactccca tcacgtcgtc 8821 cctccacctg tcctctcttc tccttccttg cttgtctttc tctctcaaat catttcacct 8881 aaaaataata aatatctttc gttttctaaa gaaaaaaaaa cttttcaaat tcatctttgg 8941 tttctgcagg caggcaaaca aacaagccct gttcgttagg gttttcggtt tttgttagct 9001 tttcttcttc ttcttcttcg tttccttcca cctgaattgt tgtaaccatg gcggagaagt 9061 atcgaggcac gttatgcttc tacacggtga cctcgatttg aaaattgtta aggctaggag 9121 gttacctaac atggatatgt tctcagaaca tttgcgccgt ctcttcaccg cctgtaacgc 9181 ctgtgctaga cccaccgata ccgatgatgt cgatcccaga gataaaggcg aattc //