GenBank-Updates@genbank.bio.net (05/25/91)
LOCUS ATHRPB1 8050 bp ds-DNA PLN 25-MAY-1991
DEFINITION Arabidopsis thaliana RpII205 gene for the largest subunit of RNA
polymerase II (EC 2.7.7.6).
ACCESSION X52494
KEYWORDS RNA polymerase; RNA polymerase II.
SOURCE Arabidopsis thaliana DNA.
ORGANISM Arabidopsis thaliana
Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
Dilleniidae; Capparales; Brassicaceae.
REFERENCE 1 (bases 1 to 8050)
AUTHORS Dietrich,M.A.
JOURNAL Unpublished (1990)
STANDARD full automatic
REFERENCE 2 (sites)
AUTHORS Dietrich,M.A., Prenger,J. and Guilfoyle,T.J.
TITLE Analysis of the genes encoding the largest subunit of RNA
polymerase II in Arabidopsis and soybean
JOURNAL Plant Mol. Biol. 15, 207-223 (1990)
STANDARD full staff_review
COMMENT *source: strain=var.Columbia; library=EMBL3 and pOCA18 genomics;
From EMBL entry ATRPB1; dated 21-DEC-1990.
FEATURES Location/Qualifiers
mat_peptide join(863..946,1136..1399,1512..1793,1871..2029,2115..2276,
2359..2579,2654..3215,3310..3596,3773..4031,4117..7178,
7292..7363,7503..7635)
/EC_number="2.7.7.6"
/note="largest subunit of RNA polymerase II"
/product="DNA-directed RNA polymerase"
/gene="RpII205"
mRNA join(484..946,1136..1399,1512..1793,1871..2029,2115..2276,
2359..2579,2654..3215,3310..3596,3773..4031,4117..7178,
7292..7363,7503..>7638)
/gene="RpII205"
CDS join(863..946,1136..1399,1512..1793,1871..2029,2115..2276,
2359..2579,2654..3215,3310..3596,3773..4031,4117..7178,
7292..7363,7503..7638)
/EC_number="2.7.7.6"
/note="largest subunit of RNA polymerase II"
/product="DNA-directed RNA polymerase"
/gene="RpII205"
/codon_start=863
TATA_signal 452..457
/note="putative"
exon 484..946
/number=1
prim_transcript 484..>7638
/gene="RpII205"
exon 484..946
intron 947..1135
/number=1
exon 1136..1399
/number=2
exon 1136..1399
exon 1136..1399
exon 1136..1399
intron 1400..1511
/number=2
exon 1512..1793
/number=3
exon 1512..1793
exon 1512..1793
exon 1512..1793
intron 1794..1870
/number=3
exon 1871..2029
/number=4
exon 1871..2029
exon 1871..2029
exon 1871..2029
intron 2030..2114
/number=4
exon 2115..2276
/number=5
exon 2115..2276
exon 2115..2276
exon 2115..2276
intron 2277..2358
/number=5
exon 2359..2579
/number=6
exon 2359..2579
exon 2359..2579
exon 2359..2579
intron 2580..2653
/number=6
exon 2654..3215
/number=7
exon 2654..3215
exon 2654..3215
exon 2654..3215
intron 3216..3309
/number=7
exon 3310..3596
/number=8
exon 3310..3596
exon 3310..3596
exon 3310..3596
intron 3597..3772
/number=8
exon 3773..4031
/number=9
exon 3773..4031
exon 3773..4031
exon 3773..4031
intron 4032..4116
/number=9
exon 4117..7178
/number=10
exon 4117..7178
exon 4117..7178
exon 4117..7178
intron 7179..7291
/number=10
exon 7292..7363
/number=11
exon 7292..7363
exon 7292..7363
exon 7292..7363
intron 7364..7502
/number=11
exon 7503..>7638
/number=12
exon 7503..>7638
intron 7671..7758
/note="differentially spliced intron in 3' untranslated
region"
polyA_signal 7824..7828
/note="putative"
polyA_signal 7895..7899
/note="putative"
polyA_signal 7934..7939
/note="putative"
BASE COUNT 2255 a 1618 c 1751 g 2426 t
ORIGIN
1 acataacaaa gtgggcttca cataacttga gactatgcct ccgtacgtgt aatatttatt
61 tgttagtttt ccgtattata ataatgggtt caccctactt tgctagagta gaagaataat
121 ttcattttat taactcccaa ctcattcatt actcactttt gggatattga agaagttagg
181 tatttactat tccgaagatg aaatgaagat aattaatcgt ttcaatctaa attacacatt
241 agatttcttt tagcgtaggg actcctcttc ctttagagct tcggccacta gcagttaaaa
301 ctttaatggt tcagattttt cagtccactt taccaaacat ggtgaaataa gggaacctat
361 tatcgttttg cctcttttaa gaagaaatct atttgacgga aaataagaag aaatcttatt
421 tttgatatat cggtttgacc aactgaaccg atataaacca atcttactag cattcttcgt
481 tacaacacgt ttctttggtc agtgccgacg caggcccgtt aatccgttta actgggcctg
541 agacagttac agcccaatta accagaacag acctttgagc cgcctatttt tctataaata
601 gaggcttagg gcatggatcg tgtttctttt gctcattcat ctctcctcgt gaagagacga
661 gagaattcca gatctggtcg tgtcaaaacc cagagaggga aaaaaaaaac ggaaaaaagt
721 ttagctcaag taactgtaaa ctactgactc acagattcaa acaaattggc aaaagagggt
781 ttaattaatt tttgattccc cttttaattc acgttctagg gttttcgatt tttgattcgt
841 ctttgatcgg agcttagccg ccatggatac gaggtttccg ttctctccgg ccgaggtctc
901 taaagtccgg gtggtccagt ttggcatact cagccccgat gagatcgtaa tgctcctttc
961 tctttctctt tctctttcag gtccattcgt ccgccgtctg tttagctggc ggggatattg
1021 gcatgatagt agattggttt agttagggag catgttcttt tctatctatt gatcttaatt
1081 ggaagcttca agtaagaaat tgagtttcac tggttgctaa tggctttttt ttcagaggca
1141 aatgtctgtt atacatgttg agcatagtga gacgaccgag aagggtaaac ctaaggtggg
1201 aggattgagt gatacccgtc ttggtacgat tgatcggaag gtgaagtgtg agacatgtat
1261 ggctaatatg gctgagtgtc cgggacattt tggctatctt gagctcgcta agccaatgta
1321 tcatgtcggt tttatgaaga cagtgttaag tatcatgaga tgtgtctgtt tcaattgctc
1381 caagatttta gctgatgagg tatgtaggag cttatttcgt gtcagcattg cttgttttgc
1441 ttatgtcttt gtttgacgta gctgttcttc aattgattat ttttatgaag gaggagcata
1501 agtttaagca ggctatgaag atcaagaatc ctaagaatag gcttaagaag attctggatg
1561 cctgcaaaaa caagaccaaa tgtgatggtg gtgatgacat tgacgatgtc caaagccaca
1621 gcacggatga accagtaaaa aagagccgag gtggatgtgg tgcacaacaa ccaaaactga
1681 ctattgaggg tatgaagatg attgcagaat acaaaaattc aaaggaagaa aatgatgagc
1741 cagatcagct tcccgagcct gcagaaagga aacagacact tggtgctgat agggttcgtc
1801 ttttctttcg aatgaatttt acctcttgtt cttgtttggc gtaatctgat ggcatgtgat
1861 attcttgcag gttttgagtg ttttgaaaag gattagtgac gcggattgtc aactcctagg
1921 tttcaaccct aagtttgctc gtcctgactg gatgattctt gaagtccttc ctattcctcc
1981 accccctgtc agaccatctg taatgatgga cgccacttcc aggagtgagg taagtcgact
2041 acggttttct gaataaaact tttcttagac caacatagtg tgtgcccctg agtttatgct
2101 tattgttgat gcaggatgac ttgacccatc agctagctat gattattcga cacaatgaaa
2161 acttgaaaag gcaggaaaaa aatggagcgc cacgtcatat tatatcgaga tttacacaac
2221 tcttgcagtt tcatatagct acgtattttg ataacgagtt gcctggacag ccaagggtaa
2281 tgcttttgta tccttccaga tatctggtat gatacatacg gatattgctt tatctggatt
2341 ggcttttatt ttcttcaggc tactcagaaa tcagggaggc ctattaaatc aatatgtagt
2401 aggctgaagg caaaggaagg cagaatcagg ggtaacttga tgggaaaacg tgttgatttc
2461 tcggcacgta ctgttattac tccagatcca acaataaata ttgatgaact tggtgttccg
2521 tggagtattg ctctgaatct cacataccca gaaacagtta ctccctataa cattgaaagg
2581 ttagtacgtc tggtgtttat ttcattttct gagagtaatt tttccgtgtt ttactaacaa
2641 tttagtttgg cagattaaag gagcttgttg attatggacc acatcctcca cctgggaaga
2701 ctggagcgaa atatatcata agagatgatg gccaaagact agatcttcgg tatcttaaga
2761 agagcagtga tcaacatttg gaacttggat acaggtatgt actactttct tattctatcc
2821 actcaacgca caaaaggtta tttctggaag tcgtcatttt tatgctttct tggtcacagg
2881 tggagcggca tttacaggat ggtgattttg ttctgtttaa tcgtcaacca agtctgcaca
2941 aaatgtctat catgggtcac aggattagga ttatgccata ttccactttc cgtctgaatt
3001 tgtctgtcac gtctccgtac aatgctgatt ttgatgggga tgagatgaac atgcatgtac
3061 cacaatcatt cgagaccaga gccgaggtgt tagagctgat gatggttcct aaatgtattg
3121 tctcccccca ggcgaatcgt cctgtgatgg gaattgtgca ggataccctc ttggggtgcc
3181 gtaaaattac aaagagagat actttcatag agaaggtagc aatttcatca gcggttgctt
3241 tttaaagtta tctatgatat tgatgagagg ataaaccaac tcttatgctg atatcttatc
3301 cattttcagg atgtattcat gaacacactg atgtggtggg aagacttcga tgggaaagtt
3361 ccggctcctg caatcttgaa gcctcgtcct ctttggactg gcaaacaagt ttttaatctt
3421 atcataccaa aacagataaa tctgttgagg tactctgctt ggcacgcaga tacagagact
3481 ggatttataa ctccggggga tactcaagtg cgaattgaaa gaggggaact tcttgccgga
3541 actctttgca aaaagaccct tggtacatct aatggaagtc tcgtgcatgt catttggtaa
3601 gttgcacatc aagattttag agtgtttatt ccggtttgtc acatttgctt actggctaaa
3661 ttttggtacc ttagttaact tcacatcaca tgcagcattg ttgggaatat ctaaatgtag
3721 taattagttt catgagctct tttgttttaa ttcattttta cttggcttcc agggaagagg
3781 ttggtcctga tgcagctaga aaattcctcg gtcatactca atggcttgtc aattactggc
3841 ttctgcagaa tggttttacc atcggaattg gtgacacaat tgccgattca tcaacaatgg
3901 agaaaattaa tgaaactatt tccaatgcaa aaactgctgt gaaagatctt atccggcagt
3961 tccagggaaa ggaattggac cctgagcctg gccgaactat gagagataca tttgagaaca
4021 gggttaacca ggtatattca agacaataat tagttgattt tcgtttggtt gtcagtgcag
4081 tttttatatg tgctgactca taaactttcc ttgtaggttt tgaataaagc tcgtgatgat
4141 gctggaagta gtgctcaaaa gagtttagca gaaaccaata accttaaggc catggtgaca
4201 gcaggatcca aaggaagttt catcaatatt tctcaaatga cagcgtgtgt cggtcagcaa
4261 aatgttgaag ggaagcgaat tccatttgga tttgatgggc ggacattgcc acatttcacc
4321 aaagatgatt atggtcctga aagtcgtggt tttgttgaga attcgtacct gcgtggcttg
4381 actcctcaag agttcttttt ccatgctatg ggaggacgag aaggtcttat tgatactgct
4441 gtgaagacat cagaaactgg atacattcag aggcgattgg taaaggctat ggaggatatt
4501 atggttaagt atgatgggac agtcagaaac tctttgggtg atgttattca atttctctat
4561 ggagaagatg gtatggatgc tgtatggata gaatcacaga agctggattc cttgaaaatg
4621 aagaaatcag agtttgatag gacttttaag tatgagattg acgacgaaaa ctggaatcct
4681 acttacctaa gtgatgaaca tcttgaagac ttgaagggga ttcgggagtt gcgtgatgta
4741 ttcgatgcgg aatattcgaa acttgagact gacagattcc aactcgggac agaaattgca
4801 acaaatggtg atagcacttg gccattgcct gttaacatca agaggcatat ctggaatgcg
4861 cagaagactt tcaaaattga cttgcgcaaa atttcagata tgcaccctgt tgaaattgtt
4921 gatgctgttg ataaactaca ggagaggctg ttggttgttc ctggtgatga tgcgttgagt
4981 gtggaagcac agaaaaacgc aacattgttc tttaacattt tgcttcgcag cactcttgct
5041 agtaaaagag tgttggaaga atacaagctc agccgcgagg cttttgagtg ggtcattggt
5101 gagattgaat caaggttttt acaatcgcta gtggccccag gggaaatgat cggttgtgtt
5161 cctgctcaat caattggaga acctgctacg cagatgactc tgaatacctt ccattatgct
5221 ggtgtcagtg caaagaacgt tacgctcgga gttcccaggt tgcgtgaaat tattaatgta
5281 gctaagagga tcaaaacacc atccctatca gtctatctca ctccggaagc tagcaaatca
5341 aaagaggggg ctaagactgt tcagtgtgct ttggagtata ctactctcag gagtgttact
5401 caagctacgg aagtctggta tgacccagat ccaatgagta caataattga agaggacttt
5461 gaatttgtga ggtcctacta tgaaatgcca gatgaagatg tttccccaga taagatatct
5521 ccgtggctac ttcgtataga gttgaatcgc gagatgatgg ttgataagaa attgagtatg
5581 gcggatattg cggagaagat caaccttgag ttcgatgatg acctaacttg catattcaat
5641 gatgataatg ctcaaaaact gatccttcga attcgcatta tgaacgatga gggcccaaag
5701 ggagagttgc aagatgaatc ggctgaagat gatgttttcc tcaaaaagat tgagagcaac
5761 atgctgacag aaatggcact cagaggtatt ccagacatca acaaggtttt tataaaacag
5821 gttagaaaga gcaggtttga tgaggaggga ggcttcaaga catctgagga gtggatgttg
5881 gatacagaag gtgtgaacct cttagctgtc atgtgtcacg aagatgtgga tccaaagagg
5941 acaacaagca atcacttgat tgaaattatt gaagttctcg gaattgaggc agttcgtcgt
6001 gctttgcttg atgaactccg tgttgtgata tcctttgatg gttcttatgt gaattaccgt
6061 catcttgcca tcttgtgtga tactatgacc tatcgcggtc atctgatggc tatcactcga
6121 cacggtatca atagaaatga cactgggcct ctgatgagat gctcttttga agaaacagtt
6181 gatattctgc tagatgctgc ggcttatgct gagacagact gcttacgggg tgttactgag
6241 aatataatgt tgggtcaact tgcaccaatt gggacaggag attgtgagtt gtatctgaat
6301 gatgagatgc tgaagaatgc aattgaactt cagctcccta gctatatgga tggtcttgaa
6361 tttggaatga ctcctgctcg ttcaccagtg tcaggcactc cttaccatga aggcatgatg
6421 tctccaaact acctgttaag tccaaatatg cgtttatccc caatgtcaga tgcacagttt
6481 tctccatatg ttggtggaat ggccttttcg ccttcttctt ctccaggata tagtccatca
6541 tcgcctggat acagtcctac ttctcccggt tacagtccaa cttcgcctgg atatagcccg
6601 acttctcccg gttacagtcc aacttcgcct acctacagtc ccagttctcc tggctatagc
6661 ccaacaagcc ctgcttattc tcctacaagt ccttcctatt ctcctacctc tccgagctac
6721 agcccaacgt ctccaagcta tagcccaacg tcgccaagct acagcccgac atctccgagc
6781 tacagtccta cttccccaag ttacagcccg acttcgcctg cttacagccc gacttcacct
6841 gcttacagcc caacttcacc agcatacagc ccaacctctc cttcttacag cccaacttca
6901 ccttcttaca gcccaacatc gccttcttac agccctactt caccatctta cagcccaaca
6961 tctccgtctt acagccctac ttcacccgca tatagcccca catctcctgg ctacagccct
7021 acttcaccaa gttacagtcc aacatcacca agctacggtc ctacgtctcc aagctacaac
7081 cctcagtctg ctaaatatag cccatctata gcttactctc ctagcaatgc aagactatca
7141 ccagctagcc cctacagtcc tacatctccc aactacaggt aagtagtaat atcttaattt
7201 ttacactttc tatgaagctt ctcttatctg ttgtggtgat attgtgtcat cttctctaat
7261 cgttaactct tattaaaaat ttactctgca gcccgacatc tccatcatac tcacccacat
7321 ctccatctta ttcaccttca agtccaacat acagtcccag caggtttgac ttttacccgc
7381 ttgaaaatct tggatctgtt gtaatgcatc catctcaatt gaattccgca aaaagtttaa
7441 ctgggttgtt gttattaaat ctgtctcagc ccatacagct caggagcaag cccagactac
7501 agcccaagcg caggctactc gccaacactt cccggttatt caccgtcatc aacgggtcag
7561 tataccccac atgagggcga taaaaaggac aagactggaa aaaaagatgc cagtaaggat
7621 gataaaggca acccttgaaa gagagtgaga attggcaatc catgttttag gtaacgtcta
7681 aatctcttgg aaccattggg gtttacaagt cttactagct cacgaactta gccacttggg
7741 atatcgtgta acccgcaggt catgatttaa gagacagtga aagctgagaa aaggaaggga
7801 ccgtttcaaa gtgatcttct gtggataact ttgtgaacaa ggttttctta atagatcctt
7861 ttttcgtgag ttgtatatta ttccaaactg atccataaat ccatccaatc ctcatccccc
7921 aaaaagaaaa tagtataaac atagaaaaac gaaaacatat cagactttgg gcttttcgta
7981 tggtttagtt tttggctttt tgctgtgttg ttctatttca gaaagtaaac atgtgaaacg
8041 gttcatttgt
//