GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS HSVHSV1PO 8400 bp ds-DNA VRL 28-MAY-1991 DEFINITION Herpes simplex virus type 1 (HSV-1) dbp/pol genes ACCESSION X03181 KEYWORDS DNA binding protein; DNA polymerase; origin of replication; overlapping genes; unidentified reading frame. SOURCE Herpes simplex virus DNA. ORGANISM Herpes simplex virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 8400) AUTHORS Quinn,J.P. and McGeoch,D.J. TITLE DNA sequence of the region in the genome of herpes simplex virus type 1 containing the genes for DNA polymerase and the major DNA binding protein JOURNAL Nucleic Acids Res. 13, 8143-8163 (1985) STANDARD full automatic COMMENT SWISS-PROT; P04293; DPOL$HSV11. SWISS-PROT; P04296; DNBI$HSV11. From EMBL entry HEHSV1PO; dated 12-APR-1990. FEATURES Location/Qualifiers misc_feature 123..123 /note="put. mRNA 3' terminus (dbp gene)" misc_feature complement(148..153) /note="polyadenylation signal (dbp gene)" CDS complement(205..3792) /note="DNA binding protein (dbp gene) (aa 1-1196)" /codon_start=3792 misc_feature 4058..4058 /note="mRNA 5' terminus (dbp gene)" promoter complement(4077..4082) /note="pot. TATA box (dbp gene)" repeat_region 4124..4131 /note="direct repeat 1" misc_feature 4143..4286 /note="Ori L palindrome" repeat_unit 4143..4224 /note="inverted repeat A" repeat_unit 4225..4286 /note="inverted repeat A'" repeat_region 4272..4279 /note="direct repeat 1" promoter 4317..4321 /note="pot. TATA sequence (pol gene)" promoter 4345..4350 /note="pot. TATA sequence (pol gene)" CDS 4546..8250 /note="DNA polymerase (pol gene) (aa 1-1235)" /codon_start=4546 misc_feature 8092..8092 /note="put. 3' terminus (URF)" misc_feature complement(8116..8121) /note="put. polyadenylation signal (URF)" CDS complement(8201..>8400) /note="unidentified reading frame" /codon_start=8400 misc_feature 8287..8292 /note="polyadenylation signal (pol gene)" misc_feature 8316..8316 /note="put. 3' terminus (pol gene)" BASE COUNT 1440 a 2806 c 2746 g 1408 t ORIGIN 1 cggcctcccg cgggcccgcc gaccggcaag ccgggagtcg gcggcgcgtg cgtttctgct 61 ctattcccag acaccgcgga gaggaatcac ggcccgccca gagatataga cacggaacac 121 aaacaagcac ggatgtcgta gcaataattt attttacaca cattccccgc cccgccctag 181 gttcccccac cccccaaccc ctcacagcat atccaacgtc aggtctccct ttttgtcggg 241 gggcccctcc ccaaacgggt catccccgtg gaacgcccgt ttgcggccgg caaatgccgg 301 tcccggggcc cccgggccgc cgaacggcgt cgcgttgtcg tcctcgcagc caaaatcccc 361 aaagttaaac acctccccgg cgttgccgag ttggctgact agggcctcgg cctcgtgcgc 421 cacctccagg gccgcgtccg tcgaccactc gccgttgccg cgctccaggg cacgcgcggt 481 cagctccatc atctcctcgc ttaggtactc gtcctccagg agcgccagcc agtcctcgat 541 ctgcagctgc tgggtgcggg gccccaggct tttcacggtc gccacgaaca cgctactggc 601 gacggccgcc ccgccctcgg agataatgcc ccggagctgc tcgcacagcg agctttcgtg 661 cgctccgccg ccgaggcttg aggccgcgca cacaaacccg gcccggggac aggccaggac 721 gaacttgcgg gtgcggtcaa aaataaggag cgggcacgcg tttttgccgc ccatcaggct 781 ggcccagttc ccggcctgaa acacacggtc gttgccggcc atgccgtagt acttgctgat 841 gctcaacccc aacacgacca tggggcgcgc cgccatgacg ggccgcagca ggttgcagct 901 ggcgaacatg gacgtccacg cgcccggatg cgcgtccacg gcgtccatca gcgcgcgggc 961 cccggcctcc aggcccgccc cgccctgcgc ggaccacgcg gccgcagcct gcacgctggg 1021 gggacggcgg gaccccgcga tgatggccgt aagggtgttg atgaagtatg tcgagtgatc 1081 gcagtaccgc agaatctggt ttgccatgta gtacatcgcc agctcgctca cgttgttggg 1141 ggccaggtta ataaagttta tcgcgccgta gtccagggaa aactttttaa tgaacgcgat 1201 ggtctcgatg tcctcgcgcg acaggagccg ggcgggaagc tggttgcgtt ggagggccgt 1261 ccagaaccac tgcgggttcg gctggttgga ccccgggggc ttgccgttgg ggaagatggc 1321 cgcgtggaac tgcttcagca gaaagcccag cggtccgagg aggatgtcca cgcgcttgtc 1381 gggcttctgg taggcgctct ggaggctggc gacccgcgcc ttggcggcct cggacgcgtt 1441 ggcgctcgcg cccgcgaaca acacgcggct cttgacgcgc agctccttgg gaaaccccag 1501 ggtcacgcgg gcaacgtcgc cctcgaagct gctctcggcg ggggccgtct ggccggccgt 1561 taggctgggg gcgcagatag ccgccccctc cgagagcgcg accgtcagcg ttttggccga 1621 cagaaacccg ttgttaaaca tgtccatcac gcgccgccgc agcaccggtt ggaattgatt 1681 gcgaaagttg cgcccctcga ccgactgccc ggcgaacacc ccgtggcact gactcagggc 1741 caggtcctgg tacacggcga ggttggatcg ccgcccgaga agctgaagca gggggcacgg 1801 cccgcacgcg tacgggtcca gcgtcaggga catggcgtgg ttggcctcgc ccagaccgtc 1861 gcgaaacttg aagttcctcc cctccaccag gttgcgcatc agctgctcca cctcgcggtc 1921 cacgacctgc ctgacgttgt tcaccaccgt atgcagggcc tcgcggttgg tgatgatggt 1981 ctccagccgc cccatggccg tggggaccgc ctggtccacg tactgcaggg tctcgagttc 2041 ggccatgacg cgctcggtcg ccgcgcggta cgtctcctgc atgatggtcc gggcggtctc 2101 ggatccgtcc gcgcgcttca gggccgagaa ggcggcgtag tttcccagca cgtcgcagtc 2161 gctgtacatg ctgttcatgg tcccgaagac gccgatggct ccgcgggcgg cgctggcgaa 2221 ctttggatgg cgcgcccgga ggcgcatgag cgtcgtgtgt acgcaggcgt ggcgcgtgtc 2281 gaaggtgcat aggttacagg gcacgtcggt ctggttggag tccgcgacgt atcgaaacac 2341 gtccatctcc tggcgcccga cgatcacggc gccgtcgcag cgctccaggt aaaacagcat 2401 cttggccagc agcgccgggg aaaacccaca cagcatggcc aggtgctcgc cggcaaattc 2461 ctgggttccg ccgacgaggg gcgcggtggg ccgaccctcg aacccgggca ccacgtgtcc 2521 ctcgcggtcc acctgtgggt tggccgccac gtgggtcccg ggcacgagga agaagcggta 2581 aaaggagggt ttgctgtggt cctttgggtc cgccgggccg gcgtcgtcca cctcggtgag 2641 atggagggcc gagttggtgc taaataccat ggcccccacg agtcccgcgg cgcgcgccag 2701 gtacgccccg acggcgttgg cgcgggccgc ggccgtgtcc tggccctcga acagcggcca 2761 cgcggagatg tcggtgggcg gctcgtcaaa gacggccatc gacacgatag actcgagggc 2821 cagggcggcg tctccggcca tgacggaggc caggcgctgt tcgaacccgc ccgccgcgcc 2881 cttgccgccg ccgtcgcgcc cgccccgcgg ggtcttaccc tggctggctt cgaaggccgt 2941 gaacgtaatg tcggcgggga gggcggcgcc ctcgtggttt tcgtcaaacg ccaggtgggc 3001 ggccgcgcgg gccacggcgt ccacgtttcg gcatcgcagt gccacggcgg cgggtcccac 3061 gaccgcctcg aacaggaggc ggttgagggg gcggttaaaa aacggaagcg ggtaggtaaa 3121 tttctccccg atcgatcggt ggttggcgtt gaacggctct gcgatgacac ggctaaaatc 3181 cggcatgaac agctgcaacg ggtacacggg tatgcggtgc acctccgccc cgcctatggt 3241 taccttgtcc gagcctccca ggtgcagaaa ggtgttgttg atgcacacgg cctccttgaa 3301 gccctcggta acgaccagat acaggagggc gcggtccggg tccaggccga ggcgctcaca 3361 cagcgcctcc cccgtcgtct cgtgtttgag gtcgccgggc cggggggtgt agtccgaaaa 3421 gccaaaatgg cggcgtgccc gctcgcaaag tcgcgtcagg ttcggggcct gggtgctggg 3481 gtccaggtgc cggccgccgt gaaagacgta cacggacgag ctgtagtgcg agggcgtcag 3541 tttcagggac accgcggtac ccccgagccc cgtcgtgcga gaacccacga ccacggccac 3601 gttggcctca aagccgctct ccacggtcag gcccacgacc aggggcgcca cggcgacgtc 3661 ggaatcgccg ctgcgtgccg acagtaacgc cagaagctcg atgccttcgg acggacacgc 3721 gcgagcgtac acgtatccca ggggcccggg ggggaccttg atggtggttg ccgtcttggg 3781 ctttgtctcc atgtcctttt gtcaatcggt ccgcgaacgg aggtaatccc ggcacgacga 3841 cggacgcccg acaaggtatg tctcccgagc gtcaaaatcc gggggggggc ggcgacggtc 3901 aaggggaggg ttggagaccg gggttgggga atgaatccct ccccttcacc gacaaccccc 3961 cgggtaacca cggggtcgcc gatgaacccc ggcggccggc aacgcggggt ccctgcgaga 4021 ggcacagatg cttacggtca ggtgctccgg gtcgggtgcg tctggtatgc ggttggtata 4081 tgtacacttt acctgggggc gtgccggtcc gccccagccc ctcccacgcc ccgcgcgtca 4141 tcagccggtg ggcgtggccg ctattataaa aaaagtgaga acgcgaagcg ttcgcacttt 4201 gtcctaataa tatatatatt attaggacaa agtgcgaacg cttcgcgttc tcactttttt 4261 tataatagcg gccacgccca ccggctacgt cactctcctg tcggccgccg gcggtccata 4321 agcccggccg gccgggccga cgcgaataaa ccgggccgcc ggccggggcg ccgcgcagca 4381 gctcgccgcc cggatccgcc agacaaacaa ggcccttgca catgccggcc cgggcgagcc 4441 tgggggtccg gtaattttgc catcccaccc aagcggcttt ttgggttttt ctcttccccc 4501 ctccccacat tcccctcttt aggggttcgg gtgggaacaa ccgcgatgtt ttccggtggc 4561 ggcggcccgc tgtcccccgg aggaaagtcg gcggccaggg cggcgtccgg gttttttgcg 4621 cccgccggcc ctcgcggagc cagccgggga cccccgcctt gtttgaggca aaacttttac 4681 aacccctacc tcgccccagt cgggacgcaa cagaagccga ccgggccaac ccagcgccat 4741 acgtactata gcgaatgcga tgaatttcga ttcatcgccc cgcgggtgct ggacgaggat 4801 gcccccccgg agaagcgcgc cggggtgcac gacggtcacc tcaagcgcgc ccccaaggtg 4861 tactgcgggg gggacgagcg cgacgctcct ccgcgtcggg tcgggcggct tctggccgcg 4921 gcgtcgcgcc tgtggggcgg cgtggaccac gccccggcgg ggttcaaccc caccgtcacc 4981 gtctttcacg tgtacgacat cctggagaac gtggagcacg cgtacggcat gcgcgcggcc 5041 cagttccacg cgcggtttat ggacgccatc acaccgacgg ggaccgtcat cacgctcctg 5101 ggcctgactc cggaaggcca ccgggtggcc gttcacgttt acggcacgcg gcagtacttt 5161 tacatgaaca aggaggaggt cgacaggcac ctacaatgcc gcgccccacg agatctctgc 5221 gagcgcatgg ccgcggccct gcgcgagtcc ccgggcgcgt cgttccgcgg catctccgcg 5281 gaccacttcg aggcggaggt ggtggagcgc accgacgtgt actactacga gacgcgcccc 5341 gctctgtttt accgcgtcta cgtccgaagc gggcgtgtgc tgtcgtacct gtgcgacaac 5401 ttctgcccgg ccatcaagaa gtacgagggt ggggtcgacg ccaccacccg gttcatcctg 5461 gacaaccccg ggttcgtcac cttcggctgg taccgtctca aaccgggccg gaacaacacg 5521 ctagcccagc cggcggcccc gatggccttc gggacatcca gcgacgtcga gtttaactgt 5581 acggcggaca acctggccat cgaggggggc atgagcgacc taccggcata caagctcatg 5641 tgcttcgata tcgaatgcaa ggcggggggg gaggacgagc tggcctttcc ggtggccggg 5701 cacccggagg acctggtcat ccagatatcc tgtctgctct acgacctgtc caccaccgcc 5761 ctggagcacg tcctcctgtt ttcgctcggt tcctgcgacc tccccgaatc ccacctgaac 5821 gagctggcgg ccaggggcct gcccacgccc gtggttctgg aattcgacag cgaattcgag 5881 atgctgttgg ccttcatgac ccttgtgaaa cagtacggcc ccgagttcgt gaccgggtac 5941 aacatcatca acttcgactg gcccttcttg ctggccaagc tgacggacat ttacaaggtc 6001 cccctggacg ggtacggccg catgaacggc cggggcgtgt ttcgcgtgtg ggacataggc 6061 cagagccact tccagaagcg cagcaagata aaggtgaacg gcatggtgaa catcgacatg 6121 tacgggatta taaccgacaa gatcaagctc tcgagctaca agctcaacgc cgtggccgaa 6181 gccgtcctga aggacaagaa gaaggacctg agctatcgcg acatccccgc ctactacgcc 6241 gccgggcccg cgcaacgcgg ggtgatcggc gagtactgca tacaggattc cctgctggtg 6301 ggccagctgt tttttaagtt tttgccccat ctggagctct cggccgtcgc gcgcttggcg 6361 ggtattaaca tcacccgcac catctacgac ggccagcaga tccgcgtctt tacgtgcctg 6421 ctgcgcctgg ccgaccagaa gggctttatt ctgccggaca cccaggggcg atttaggggc 6481 gccggggggg aggcgcccaa gcgtccggcc gcagcccggg aggacgagga gcggccagag 6541 gaggaggggg aggacgagga cgaacgcgag gagggcgggg gcgagcggga gccggagggc 6601 gcgcgggaga ccgccggcag gcacgtgggg taccaggggg ccagggtcct tgaccccact 6661 tccgggtttc acgtgaaccc cgtggtggtg ttcgactttg ccagcctgta ccccagcatc 6721 atccaggccc acaacctgtg cttcagcacg ctctccctga gggccgacgc agtggcgcac 6781 ctggaggcgg gcaaggacta cctggagatc gaggtggggg ggcgacggct gttcttcgtc 6841 aaggctcacg tgcgagagag cctcctcagc atcctcctgc gggactggct cgccatgcga 6901 aagcagatcc gctcgcggat tccccagagc agccccgagg aggccgtgct cctggacaag 6961 cagcaggccg ccatcaaggt cgtgtgtaac tcggtgtacg ggttcacggg agtgcagcac 7021 ggactcctgc cgtgcctgca cgttgccgcg acggtgacga ccatcggccg cgagatgctg 7081 ctcgcgaccc gcgagtacgt ccacgcgcgc tgggcggcct tcgaacagct cctggccgat 7141 ttcccggagg cggccgacat gcgcgccccc gggccctatt ccatgcgcat catctacggg 7201 gacacggact ccatctttgt gctgtgccgc ggcctcacgg ccgccgggct gacggccgtg 7261 ggcgacaaga tggcgagcca catctcgcgc gcgctgtttc tgccccccat caaactcgag 7321 tgcgaaaaga cgttcaccaa gctgctgctg atcgccaaga aaaagtacat cggcgtcatc 7381 tacgggggta agatgctcat caagggcgtg gatctggtgc gcaaaaacaa ctgcgcgttt 7441 atcaaccgca cctccagggc cctggtcgac ctgctgtttt acgacgatac cgtctccgga 7501 gcggccgccg cgttagccga gcgccccgcg gaggagtggc tggcgcgacc cctgcccgag 7561 ggactgcagg cgttcggggc cgtcctcgta gacgcccatc ggcgcatcac cgacccggag 7621 agggacatcc aggactttgt cctcaccgcc gaactgagca gacacccgcg cgcgtacacc 7681 aacaagcgcc tggcccacct gacggtgtat tacaagctca tggcccgccg cgcgcaggtc 7741 ccgtccatca aggaccggat cccgtacgtg atcgtggccc agacccgcga ggtagaggag 7801 acggtcgcgc ggctggccgc cctccgcgag ctagacgccg ccgccccagg ggacgagccc 7861 gccccccccg cggccctgcc ctccccggcc aagcgccccc gggagacgcc gtcgcctgcc 7921 gaccccccgg gaggcgcgtc caagccccgc aagctgctgg tgtccgagct ggccgaggat 7981 cccgcatacg ccattgccca cggcgtcgcc ctgaacacgg actattactt ctcccacctg 8041 ttgggggcgg cgtgcgtgac attcaaggcc ctgtttggga ataacgccaa gatcaccgag 8101 agtctgttaa aaaggtttat tcccgaagtg tggcaccccc cggacgacgt ggccgcgcgg 8161 ctccggaccg cagggttcgg ggcggtgggt gccggcgcta cggcggagga aactcgtcga 8221 atgttgcata gagcctttga tactctagca tgagcccccc gtcgaagctg atgtccctca 8281 ttttacaata aatgtctgcg gccgacacgg tcggaatctc cgcgtccgtg ggtttctctg 8341 cgttgcgccg gaccacgagc acaaacgtgc tctgccacac gtgggcgacg aaccggtacc //
GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS HS1HSV1SU 12979 bp ds-DNA VRL 28-MAY-1991 DEFINITION Herpes simplex virus type 1 (HSV1) short unique region DNA ACCESSION X02138 KEYWORDS glycoprotein; unidentified reading frame. SOURCE Herpes simplex virus type 1 DNA. ORGANISM Herpes simplex virus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 12979) AUTHORS Mcgeoch,D.J., Dolan,A., Donald,S. and Rixon,F.J. TITLE Sequence determination and genetic content of the short unique region in the genome of herpes simplex virus type 1 JOURNAL J. Mol. Biol. 181, 1-13 (1985) STANDARD full automatic COMMENT SWISS-PROT; P03170; IE12$HSV11. SWISS-PROT; P03171; VGLD$HSV11. SWISS-PROT; P04413; KR1$HSV11. SWISS-PROT; P04485; IE68$HSV11. SWISS-PROT; P04487; DNB$HSV11. SWISS-PROT; P04488; VGLE$HSV11. SWISS-PROT; P06480; US05$HSV11. SWISS-PROT; P06481; US09$HSV11. SWISS-PROT; P06484; VGLG$HSV11. SWISS-PROT; P06485; US02$HSV11. SWISS-PROT; P06486; US10$HSV11. SWISS-PROT; P06487; VGLI$HSV11. From EMBL entry HEHSV1SU; dated 19-DEC-1990. FEATURES Location/Qualifiers precursor_RNA <1..1356 /note="primary transcript Us1" misc_feature 1..12979 /note="HSV1 unique sequence Us" CDS 40..1299 /product="Umw 68 (Us1)" /codon_start=40 repeat_region 1182..11196 /note="direct repeat 6" misc_feature 1337..1342 /note="polyadenylation signal (Us1)" precursor_RNA complement(1419..2700) /note="primary transcript Us2" misc_feature complement(1432..1437) /note="polyadenylation signal (Us2)" CDS complement(1452..2324) /note="32K (Us2)" /codon_start=2324 CDS complement(2286..2321) /note="pot. signal peptide for membrane-bound translation (Us2) (aa 2-13)" /codon_start=2321 promoter 2330..2336 /note="TATA-box like sequence (Us3a)" precursor_RNA 2360..4927 /note="primary transcript Us3a" promoter 2559..2564 /note="TATA-box like sequence (Us3b)" precursor_RNA 2585..4927 /note="primary transcription Us3b" CDS 2618..4060 /note="53K (Us3) (aa 1-481)" /codon_start=2618 promoter complement(2723..2729) /note="TATA-box like sequence (Us2)" promoter 4098..4105 /note="TATA-box like sequence (Us4)" precursor_RNA 4125..4927 /note="primary transcript (Us4)" CDS 4140..4853 /note="25K (Us4) (aa 1-238)" /codon_start=4140 misc_feature 4155..4157 /note="pot. alternative start codon for 25K" CDS 4161..4214 /note="pot. signal peptide for membrane-bound translation (Us4) (aa 8-22)" /codon_start=4161 CDS 4707..4772 /note="put. transmembrane region (Us4) (aa 190-211)" /codon_start=4707 misc_feature 4904..4909 /note="polyadenylation signal (Us3)" misc_feature 4904..4909 /note="polyadenylation signal (Us4)" promoter 4991..4997 /note="TATA-box like sequence (Us5)" precursor_RNA 5026..8429 /note="pot. primary transcript (Us5)" CDS 5127..5402 /note="9K (Us5) (aa 1-92)" /codon_start=5127 CDS 5151..5195 /note="pot. signal peptide for membrane-bound translation (Us5) (aa 9-23)" /codon_start=5151 CDS 5256..5336 /note="pot. transmembrane sequence (Us5) (aa 44-92)" /codon_start=5256 misc_feature 5409..5423 /note="variable C tract" promoter 5705..5712 /note="TATA-box like sequence (Us6)" misc_feature 5729..5738 /note="multiple mRNA 5' site (Us6)" CDS 5815..6996 /product="glycoprotein gD (Us6)" /codon_start=5815 CDS 5836..5874 /note="pot. signal peptide for membrane-bound translation (Us6) (aa 8-20)" /codon_start=5836 CDS 6811..6906 /note="pot. transmembrane sequence (Us6) (aa 333-364)" /codon_start=6811 promoter 7062..7069 /note="TATA-box like sequence (Us7)" precursor_RNA 7090..8429 /note="pot. primary transcription Us7" CDS 7181..8350 /note="41K (Us7) (aa 1-390)" /codon_start=7181 CDS 7193..7249 /note="pot. signal peptide for membrane-bound translation (Us7) (aa 5-23)" /codon_start=7193 repeat_region 7800..7810 /note="direct repeat x1" repeat_region 7811..7820 /note="direct repeat y1" repeat_region 7821..7831 /note="direct repeat x2" repeat_region 7832..7841 /note="direct repeat y2" repeat_region 7842..7852 /note="direct repeat z1" repeat_region 7853..7862 /note="direct repeat y3" repeat_region 7863..7873 /note="direct repeat z2" CDS 7978..8068 /note="pot. transmembrane sequence (Us7) (aa 267-296)" /codon_start=7978 CDS 8069..8069 /note="pot. anchor sequence (Us7) (aa 297-308)" /codon_start=8069 misc_feature 8409..8414 /note="polyadenylation signal (Us5)" misc_feature 8409..8414 /note="polyadenylation signal (Us6)" misc_feature 8409..8414 /note="polyadenylation signal (Us7)" misc_feature 8429..8429 /note="mRNA 3' site (Us6)" promoter 8535..8541 /note="TATA-box like sequence (Us8)" precursor_RNA 8567..11088 /note="primary transcript (Us8)" CDS 8639..10288 /product="glycoprotein gE (Us8)" /codon_start=8639 CDS 8648..8707 /note="pot. signal peptide for membrane-bound translation (Us8) (aa 4-23)" /codon_start=8648 CDS 9896..9970 /note="pot. transmembrane sequence (Us8) (aa 420-444)" /codon_start=9896 promoter 10614..10619 /note="TATA-box like sequence (Us9)" precursor_RNA 10641..11088 /note="primary transcript (Us9)" CDS 10708..10977 /note="10K (Us9) (aa 1-90)" /codon_start=10708 misc_feature 11063..11068 /note="polyadenylation signal (Us8 Us9)" repeat_region 11107..11121 /note="direct repeat 1" repeat_region 11122..11136 /note="direct repeat 2" repeat_region 11137..11151 /note="direct repeat 3" repeat_region 11152..11166 /note="direct repeat 4" repeat_region 11167..11181 /note="direct repeat 5" repeat_region 11197..11211 /note="direct repeat 7" repeat_region 11212..11226 /note="direct repeat 8" repeat_region 11227..11241 /note="direct repeat 9" repeat_region 11242..11256 /note="direct repeat 10" repeat_region 11257..11259 /note="imperfect direct repeat" precursor_RNA complement(11514..12561) /note="primary transcript (Us10)" precursor_RNA complement(11514..12855) /note="18K (Us11) (aa 1-134)" precursor_RNA complement(11514..>12979) /note="primary transcript (Us12)" misc_feature 11530..11535 /note="polyadenylation signal (Us10)" misc_feature 11530..11535 /note="polyadenylation signal (Us11)" misc_feature 11530..11535 /note="polyadenylation signal (Us12)" CDS complement(11556..12490) /note="34K (Us10) (aa 1-284)" /codon_start=12490 CDS complement(12159..12641) /note="TATA-box like sequence (Us11)" /codon_start=12641 repeat_region 12204..12220 /note="direct repeat 1" repeat_region 12223..12238 /note="direct repeat 2" repeat_region 12241..12256 /note="direct repeat 3" repeat_region 12259..12263 /note="imperfect direct repeat" promoter complement(12582..12588) /note="TATA-box like sequence (Us10)" CDS complement(12709..12972) /note="Umw12 (Us12) (aa 1-85)" /codon_start=12972 promoter complement(12879..12886) /note="primary transcript (Us11)" BASE COUNT 2286 a 4271 c 4078 g 2344 t ORIGIN 1 cggggggaag ccactgtggt cctccgggac gttttctgga tggccgacat ttccccaggc 61 gcttttgcgc cttgtgtaaa agcgcggcgt cccgctctcc gatccccgcc cctgggcacg 121 cgcaagcgca agcgcccttc ccgccccctc tcatcggagt ctgaggtaga atccgataca 181 gccttggagt ctgaggtcga atccgagaca gcatcggatt cgaccgagtc tggggaccag 241 gatgaagccc cccgcatcgg tggccgtagg gccccccgga ggcttggggg gcggtttttt 301 ctggacatgt cggcggaatc caccacgggg acggaaacgg atgcgtcggt gtcggacgac 361 cccgacgaca cgtccgactg gtcttatgac gacattcccc cacgacccaa gcgggcccgg 421 gtaaacctgc ggctcacgag ctctcccgat cggcgggatg gggttatttt tcctaagatg 481 gggcgggtcc ggtctacccg ggaaacgcag ccccgggccc ccaccccgtc ggccccaagc 541 ccaaatgcaa tgctacggcg ctcggtgcgc caggcccaga ggcggagcag cgcacgatgg 601 acccccgacc tgggctacat gcgccagtgt atcaatcagc tgtttcgggt cctgcgggtc 661 gcccgggacc cccacggcag tgccaaccgc ctgcgccacc tgatacgcga ctgttacctg 721 atgggatact gccgagcccg tctggccccg cgcacgtggt gccgtttgct gcaggtgtcc 781 ggcggaacct ggggcatgca cctgcgcaac accatacggg aggtggaggc tcgattcgac 841 gccaccgcgg aacccgtgtg caagcttcct tgtttggaga ccagacggta cggcccggag 901 tgtgatctta gtaatctcga gattcatctc agcgcgacaa gcgatgatga aatctccgat 961 gccaccgatc tggaggccgc cggttcggac cacacgctcg cgtcccagtc cgacacggag 1021 gatgccccct cccccgttac gctggaaacc ccagaacccc gcgggtccct cgctgtgcgt 1081 ctggaggatg agtttgggga gtttgactgg accccccagg agggctccca gccctggctg 1141 tctgcggtcg tggccgatac cagctccgtg gaacgcccgg gcccatccga ttctggggcg 1201 ggtcgcgccg cagaagaccg caagtgtctg gacggctgcc ggaaaatgcg cttctccacc 1261 gcctgcccct atccgtgcag cgacacgttt ctccggccgt gagtccggtc gccccgaccc 1321 ccttgtatgt ccccaaaata aaagaccaaa atcaaagcgt ttgtcccagc gtcttaatgg 1381 cgggaagggc ggagagaaac agaccacgcg gacatggggg gtgtttgggg gtttattggc 1441 accgggggct aaagggtggt aaccggatag cagatgtgag gaagtcgggg ccgttcgccg 1501 cgaacggcga tcagagggtc agtttcttgc ggaccacggc ccggcgatgt gggttgctcg 1561 tctgggacct cgggcatgcc catacacgca caacacggac gccgcaccgg atgggacgtc 1621 gtaagggggc ctggggtagc tgggtggggt ttgtgcagag caatcaggga ccgcagccag 1681 cgcatacaat cgcgctcccg tccgtttgtc ccgggcagta ccacgccgta ctggtattcg 1741 taccggctga gcagggtctc cagggggtgg ttgggggccg cggggaacgg ggtccacgcc 1801 acggtccact cgggcaaaaa ccgagtcggc acggcccacg gttctcccac ccacgcgtct 1861 ggggtcttga tggcgataaa tcttaccccg agccggattt tttgggcgta ttcgagaaac 1921 ggcacacaca gatccgccgc gcctaccacc cacaagtggt agaggcgagg ggggctgggt 1981 tggtctcggt gcagcagtcg gaagcacgcc acggcgtcca cgacctcggt gctctccaag 2041 gggctgtcct ccgcaaacag gcccgtggtg gtgtttgggg ggcagcgaca ggacctagtg 2101 cgcacgatcg ggcgggtggg tttgggtaag tccatcagcg gctcggccaa ccgtcgaagg 2161 ttggccggac gaacgacgac cggggtaccc aggggttctg atgccaaaat gcggcactgc 2221 ctaagcagga agctccacag ggccgggctt gcgtcgacgg aagtccgggg cagggcgttg 2281 ttctggtcaa ggagggtcat tacgttgacg acaacaacgc ccatgttggt atattacagg 2341 cccgtgtccg atttggggca cttgcagatt tgtaaggcca cgcacggcgg ggagacaggc 2401 cgacgcgggg gctgctctaa aaatttaagg gccctacggt ccacagaccc gccttcccgg 2461 gggggccctt ggagcgaccg gcagcggagg cgtccggggg aggggagggt gatttacggg 2521 ggggtaggtc agggggtggg tcgtcaaact gccgctcctt aaaaccccgg ggcccgtcgt 2581 tcggggtgct cgttggttgg cactcacggt gcggcgaatg gcctgtcgta agttttgtcg 2641 cgtttacggg ggacagggca ggaggaagga ggaggccgtc ccgccggaga caaagccgtc 2701 ccgggtgttt cctcatggcc ccttttatac cccagccgag gacgcgtgcc tggactcccc 2761 gcccccggag acccccaaac cttcccacac cacaccaccc agcgaggccg agcgcctgtg 2821 tcatctgcag gagatccttg cccagatgta cggaaaccag gactacccca tagaggacga 2881 ccccagcgcg gatgccgcgg acgatgtcga cgaggacgcc ccggacgacg tggcctatcc 2941 ggaggaatac gcagaggagc tttttctgcc cggggacgcg accggtcccc ttatcggggc 3001 caacgaccac atccctcccc cgtgtggcgc atctcccccc ggtatacgac gacgcagccg 3061 ggatgagatt ggggccacgg gatttaccgc ggaagagctg gacgccatgg acagggaggc 3121 ggctcgagcc atcagccgcg gcggcaagcc cccctcgacc atggccaagc tggtgactgg 3181 catgggcttt acgatccacg gagcgctcac cccaggatcg gaggggtgtg tctttgacag 3241 cagccatcca gattaccccc aacgggtaat cgtgaaggcg gggtggtaca cgagcacgag 3301 ccacgaggcg cgactgctga ggcgactgga ccacccggcg atcctgcccc tcctggacct 3361 gcatgtcgtc tccggggtca cgtgtctggt cctccccaag taccaggccg acctgtatac 3421 ctatctgagt aggcgcctga acccactggg acgcccgcag atcgcagcgg tctcccggca 3481 gctcctaagc gccgttgact acattcaccg ccagggcatt atccaccgcg acattaagac 3541 cgaaaatatt tttattaaca cccccgagga catttgcctg ggggactttg gcgccgcgtg 3601 cttcgtgcag ggttcccgat caagcccctt cccctacgga atcgccggaa ccatcgacac 3661 caacgccccc gaggtcctgg ccggggatcc gtataccacg accgtcgaca tttggagcgc 3721 cggtctggtg atcttcgaga ctgccgtcca caacgcgtcc ttgttctcgg ccccccgcgg 3781 ccccaaaagg ggcccgtgcg acagtcagat cacccgcatc atccgacagg cccaggtcca 3841 cgttgacgag ttttccccgc atccagaatc gcgcctcacc tcgcgctacc gctcccgcgc 3901 ggccgggaac aatcgcccgc cgtacacccg accggcctgg acccgctact acaagatgga 3961 catagacgtc gaatatctgg tttgcaaagc cctcaccttc gacggcgcgc ttcgccccag 4021 cgccgcagag ctgctttgtt tgccgctgtt tcaacagaaa tgaccgcccc ctgggggcgg 4081 tgctgtttgc gggttggcac aaaaagaccc cgatccgcgt ctgtggtgtt tttggcatca 4141 tgtcgcaggg cgccatgcgt gccgttgttc ccattatccc attccttttg gttcttgtcg 4201 gtgtatcggg ggttcccacc aacgtctcct ccaccaccca accccaactc cagaccaccg 4261 gtcgtccctc gcatgaagcc cccaacatga cccagaccgg caccaccgac tctcccaccg 4321 ccatcagcct taccacgccc gaccacacac cccccatgcc aagtattgga ctggaggagg 4381 aggaagagga ggagggggcc ggggacggcg aacatcttga ggggggagat gggacccgtg 4441 acaccctacc ccagtccccg ggcccagcct tcccgttggc tgaggacgtc gagaaggaca 4501 aacccaaccg tcccgtagtc ccatcccccg atcccaacaa ctcccccgcg cgccccgaga 4561 ccagtcgccc gaagacaccc cccaccatta tcgggccgct ggcaactcgc cccacgaccc 4621 gactcacctc aaagggacga cccttggttc cgacgcctca acataccccg ctgttctcgt 4681 tcctcactgc ctcccccgcc ctggacaccc tcttcgtcgt cagcaccgtc atccacacct 4741 tatcgttttt gtgtattggt gcgatggcga cacacctgtg tggcggttgg tccagacgcg 4801 ggcgacgcac acaccctagc gtgcgttacg tgtgcctgcc gtccgaacgc gggtagggta 4861 tggggcgggg gatggggaga gcccacatgc ggaaagcaag aacaataaag gcggtggtat 4921 ctagttgata tgcatctctg ggtgtttttg gggtgtggcg gacgcggggc ggtcattgga 4981 cggggtgcag ttaaatacat gcccgggacc catgaagcat gcgcgacttc cgggcctcag 5041 aacccacccg aaacggccaa cggacgtctg agccaggcct ggctatccgg agaaacagca 5101 cacgacttgg cgttctgtgt gtcgcgatgt ctctgcgcgc agtctggcat ctggggcttt 5161 tgggaagcct cgtgggggct gttcttgccg ccacccatcg gggacctgcg gccaacacaa 5221 cggacccctt aacgcacgcc ccagtgtccc ctcaccccag ccccctgggg ggctttgccg 5281 tccccctcgt agtcggtggg ctgtgcgccg tagtcctggg ggcggcatgt ctgcttgagc 5341 tcctgcgtcg tacgtgccgc gggtgggggc gttaccatcc ctacatggac ccagttgtcg 5401 tataatttcc cccccccccc cccttctccg cgtgggtgat gtcgggtcca aactcccgac 5461 accaccagct ggcatggtat aaatcaccgg tgcgcccccc aaaccatgtc cggcaggggg 5521 atgggggggc aatgcggagg gcacccaaca acaccgggct aaccaggaaa tccgtggccc 5581 cggcccccaa taaagatcgc ggtagcccgg ccgtgtgaca ctatcgtcca taccgaccac 5641 accgacgaat cccccaaggg ggaggggcca ttttacgagg aggaggggta taacaaagtc 5701 tgtctttaaa aagcaggggt tagggagttg ttcggtcata agcttcagcg cgaacgacca 5761 actaccccga tcatcagtta tccttaaggt ctcttttgtg tggtgcgttc cggtatgggg 5821 ggggctgccg ccaggttggg ggccgtgatt ttgtttgtcg tcatagtggg cctccatggg 5881 gtccgcagca aatatgcctt ggtggatgcc tctctcaaga tggccgaccc caatcgcttt 5941 cgcggcaaag accttccggt cctggaccag ctgaccgacc ctccgggggt ccggcgcgtg 6001 taccacatcc aggcgggcct accggacccg ttccagcccc ccagcctccc gatcacggtt 6061 tactacgccg tgttggagcg cgcctgccgc agcgtgctcc taaacgcacc gtcggaggcc 6121 ccccagattg tccgcggggc ctccgaagac gtccggaaac aaccctacaa cctgaccatc 6181 gcttggtttc ggatgggagg caactgtgct atccccatca cggtcatgga gtacaccgaa 6241 tgctcctaca acaagtctct gggggcctgt cccatccgaa cgcagccccg ctggaactac 6301 tatgacagct tcagcgccgt cagcgaggat aacctggggt tcctgatgca cgcccccgcg 6361 tttgagaccg ccggcacgta cctgcggctc gtgaagataa acgactggac ggagattaca 6421 cagtttatcc tggagcaccg agccaagggc tcctgtaagt acgccctccc gctgcgcatc 6481 cccccgtcag cctgcctctc cccccaggcc taccagcagg gggtgacggt ggacagcatc 6541 gggatgctgc cccgcttcat ccccgagaac cagcgcaccg tcgccgtata cagcttgaag 6601 atcgccgggt ggcacgggcc caaggcccca tacacgagca ccctgctgcc cccggagctg 6661 tccgagaccc ccaacgccac gcagccagaa ctcgccccgg aagaccccga ggattcggcc 6721 ctcttggagg accccgtggg gacggtggcg ccgcaaatcc caccaaactg gcacataccg 6781 tcgatccagg acgccgcgac gccttaccat cccccggcca ccccgaacaa catgggcctg 6841 atcgccggcg cggtgggcgg cagtctcctg gcagccctgg tcatttgcgg aattgtgtac 6901 tggatgcgcc gccacactca aaaagcccca aagcgcatac gcctccccca catccgggaa 6961 gacgaccagc cgtcctcgca ccagcccttg ttttactaga taccccccct taatgggtgc 7021 gggggggtca ggtctgcggg gttgggatgg gaccttaact ccatataaag cgagtctgga 7081 aggggggaaa ggtggacagt cgataagtcg gtagcggggg acgcgcacct gttccgcctg 7141 tcgcacccac agcttttttt gcgaaccgtc ccgttccggg atgccgtgcc gcccgttgca 7201 gggcctggtg ctcgtgggcc tctgggtctg tgccaccagc ctggttgtcc gtggccccac 7261 ggtcagtctg gtatcaaact catttgtgga cgccggggcc ttggggcccg acggcgtagt 7321 ggaggaagac ctgcttattc tcggggagct tcgctttgtg ggggaccagg tcccccacac 7381 cacctactac gatgggggcg tagagctgtg gcactacccc atgggacaca aatgcccacg 7441 ggtcgtgcat gtcgtcacgg tgaccgcgtg cccacgtcgc cccgccgtgg cattcgccct 7501 gtgtcgcgcg accgacagca ctcacagccc cgcatatccc accctggagc tcaatctggc 7561 ccaacagccg cttttgcggg tccagagggc aacgcgggac tatgccgggg tgtacgtgtt 7621 acgcgtatgg gtcggtgacg cgccaaacgc cagcctgttt gtcctgggga tggccatagc 7681 cgccgaaggg actctggcgt acaacggctc ggcctatggc tcctgcgacc cgaaactgct 7741 tccgtcttcg gccccgcgtc tggccccggc gagcgtatac caacccgccc ctaaccaggc 7801 ctccaccccc tcgaccacca cctccacccc ctcgaccacc atccccgctc cctcgaccac 7861 catccccgct ccccaagcat cgaccacgcc cttccccacg ggagatccaa aaccacaacc 7921 tcccggggtc aaccacgaac ccccatctaa tgccacgcga gcgacccgcg actcgcgata 7981 cgcgctaacg gtgacccaga taatccagat agccatcccc gcgtccatca tagccctggt 8041 gtttctgggg agctgtattt gctttataca cagatgtcaa cgccgctacc gacgctcccg 8101 tcgcccgatt tacagccccc agatgcccac gggcatctca tgcgcggtga acgaagcggc 8161 catggcccgc ctcggagccg agctcaaatc gcatccgagc acccccccca aatcccggcg 8221 ccggtcgtca cgcacgccaa tgccctccct gacggccatc gccgaagagt cggagcccgc 8281 tggggcggct gggcttccga cgccccccgt ggaccccacg acacccaccc caacgcctcc 8341 cctgttggta taggtccacg gccactggcc gggagcacca cataaccgac cgcagtccct 8401 gagttgggaa taaaccggta ttatttacct atatccgtgt atgtcgattt ctttcccccc 8461 ctccccggaa accaaagaag gaagcaaaga atggatggga ggagttcagg aagccgggga 8521 gagggcccgc ggcgcattta aggcgttgtt gtgttgactt tgcctcttct ggcgggttgg 8581 tgcggtgctg tttgttgggc tcccatttta cccgaagatc ggctgctatc cccgggacat 8641 ggatcgcggg gcggtggtgg ggtttcttct cggtgtttgt gttgtatcgt gcttggcggg 8701 aacgcccaaa acgtcctgga gacgggtgag tgtcggcgag gacgtttcgt tgcttccagc 8761 tccggggcct acggggcgcg gcccgaccca gaaactacta tgggccgtgg aacccctgga 8821 tgggtgcggc cccttacacc cgtcgtgggt ctcgctgatg ccccccaagc aggtgcccga 8881 gacggtcgtg gatgcggcgt gcatgcgcgc tccggtcccg ctggcgatgg cgtacgcccc 8941 cccggcccca tctgcgaccg ggggtctacg aacggacttc gtgtggcagg agcgcgcggc 9001 cgtggttaac cggagtctgg ttattcacgg ggtccgagag acggacagcg gcctgtatac 9061 cctgtccgtg ggcgacataa aggacccggc tcgccaagtg gcctcggtgg tcctggtggt 9121 gcaaccggcc ccagttccga ccccaccccc gaccccagcc gattacgacg aggatgacaa 9181 tgacgagggc gaggacgaaa gtctcgccgg cactcccgcc agcgggaccc cccggctccc 9241 gcctcccccc gcccccccga ggtcttggcc cagcgccccc gaagtctcac atgtgcgtgg 9301 ggtgaccgtg cgtatggaga ctccggaagc tatcctgttt tcccccgggg agacgttcag 9361 cacgaacgtc tccatccatg ccatcgccca cgacgaccag acctactcca tggacgtcgt 9421 ctggttgagg ttcgacgtgc cgacctcgtg tgccgagatg cgaatatacg aatcgtgtct 9481 gtatcacccg cagctcccag aatgtctgtc cccggccgac gcgccgtgcg ccgcgagtac 9541 gtggacgtct cgcctggccg tccgcagcta cgcggggtgt tccagaacaa accccccacc 9601 gcgctgttcg gccgaggctc acatggagcc cgtcccgggg ctggcgtggc aggcggcctc 9661 cgtcaatctg gagttccggg acgcgtcccc acaacactcc ggcctgtatc tgtgtgtggt 9721 gtacgtcaac gaccatattc acgcctgggg ccacattacc atcagcaccg cggcgcagta 9781 ccggaacgcg gtggtggaac agcccctccc acagcgcggc gcggatttgg ccgagcccac 9841 ccacccgcac gtcggggccc ctccccacgc gcccccaacc cacggcgccc tgcggttagg 9901 ggcggtgatg ggggccgccc tgctgctgtc tgcactgggg ttgtcggtgt gggcgtgtat 9961 gacctgttgg cgcaggcgtg cctggcgggc ggttaaaagc agggcctcgg gtaaggggcc 10021 cacgtacatt cgcgtggccg acagcgagct gtacgcggac tggagctcgg acagcgaggg 10081 agaacgcgac caggtcccgt ggctggcccc cccggagaga cccgactctc cctccaccaa 10141 tggatccggc tttgagatct tatcaccaac ggctccgtct gtataccccc gtagcgatgg 10201 gcatcaatct cgccgccagc tcacaacctt tggatccgga aggcccgatc gccgttactc 10261 ccaggcctcc gattcgtccg tcttctggta aggcgcccca tcccgaggcc ccacgtcggt 10321 cgccgaactg ggcgaccgcc ggcgaggtgg acgtcggaga cgagctaatc gcgatttccg 10381 acgaacgcgg acccccccga catgaccgcc cgcccctcgc cacgtcgacc gcgccctcgc 10441 cacacccgcg acccccgggc tacacggccg ttgtctcccc gatggccctc caggctgtcg 10501 acgccccctc cctgtttgtc gcctggctgg ccgctcggtg gctccggggg gcttccggcc 10561 tgggggcctc ctgtgtggga ttgcgtggta tgtgacgtca attgcccgag gcgcataaag 10621 ggccggtggt ccgcctagcc gcagcaaatt aaaaatcgtg agtcacagcg accgcaactt 10681 cccacccgga gctttcttcc ggcctcgatg acgtcccggc tctccgatcc caactcctca 10741 gcgcgatccg acatgtccgt gccgctttat cccacggcct cgccagtttc ggtcgaagcc 10801 tactactcgg aaagcgaaga cgaggcggcc aacgacttcc tcgtacgcat gggccgccaa 10861 cagtcggtat taaggcgtcg acgcagacgc acccgctgcg tcggcatggt gatcgcctgt 10921 ctcctcgtgg ccgttctgtc gggcggattt ggggcgctcc tgatgtggct gctccgctaa 10981 aagaccgcat cgacacgcgc gtccttcttg tcgtctctct tcccccccat caccccgcaa 11041 tttgcaccca gcctttaact acattaaatt gggttcgatt ggcaatgttg tctcccggtt 11101 gatttttggg tgggtgggga gtgggtgggt ggggagtggg tgggtgggga gtgggtgggt 11161 ggggagtggg tgggtgggga gtgggtgggt ggggagtggg tgggtgggga gtgggtgggt 11221 ggggagtggg tgggtgggga gtgggtgggt ggggagtggc aaggaagaaa caagcccgac 11281 caccagacag aaaatgtaac catacccaaa ccgactctgg gggctgtttg tggggtcgga 11341 accataggat gaacaaacca ccccgtacca cccgcaccca agggtgcggt ggctcatcgg 11401 catctgtccg gtatgggttg ttccccaccc actcgcgttc ggacgtctta gaatcatggc 11461 ggttttctat gccgacatcg gttttctccc ccgcaataag acacgatgcg ataaaatctg 11521 tttgtaaaat ttattaaggg tacaaattgc cctagcacag gggtggggtt agggccgggt 11581 ccccacaccc aaacgcacca aacagatgca ggcagtgggt cgagtacagc cccgcgtacg 11641 aacacgtcga tgcgtgtgtc agacagcacc agaaagcaca ggccatcaac aggtcgtgca 11701 tgtgtcggtg ggtttggacg cggggggcca tggtggtgat aaagttaatg gccgccgtcc 11761 gccagggcca caggggcgac gtctcttggt tggcccggag ccactgggtg tggaccagcc 11821 gcgcgtggcg gcccaacatg gcccctgtag ccgggggcgg gggatcgcgc acgtttgcag 11881 cgcacatgcg agacacctcg accacggttc gaaagaaggc ccggtggtcc gcgggcaaca 11941 tcaccaggtg cgcaagcgcc cgggcgtcca gagggtagag ccctgagtca tccgaggttg 12001 gctcatcgcc cgggtcttgc cgcaagtgcg tgtgggttgg gcttccggtg ggcgggacgc 12061 gaaccgcggt gtggatcccg acgcgggccc gagcgtatgc tccatgttgt ggggagaagg 12121 ggtctgggct cgccaggggg gcatacttgc ccgggctata cagacccgcg agccgtacgt 12181 ggttcgcggg gggtgcgtgg ggtccggggc tcccggggag accggggctc ccggggagac 12241 cggggctccc tgggagaccg gggttgtcgt ggatccctgg ggtcacgcgg taccctgggg 12301 tctctgggag ctcgcggtac tctgggttcc ctaggttctc ggggtggtcg cggaacccgg 12361 ggctcccggg gaacacgcgg tgtcctgggg attgttggcg gtcggacggc ttcagatggc 12421 ttcgagatcg tagtgtccgc accgactcgt agtagacccg aatctccaca ttgccccgcc 12481 gcttgatcat tatcaccccg ttgcgggggt ccggagatca tgcgcgggtg tcctcgaggt 12541 gcgtgaacac ctctggggtg catgccggcg gacggcacgc cttttaagta aacatctggg 12601 tcgcccggcc caactggggc cgggggttgg gtctggctca tctcgagagc cacggggggg 12661 aaccaccctc cgcccagaga ctcgggtgat ggtcgtaccc gggactcaac gggttaccgg 12721 attacgggga ctgtcggtca cggtcccgcc ggttcttcga tgtgccacac ccaaggatgc 12781 gttgggggcg atttcgggca gcagcccggg agagcgcagc aggggacgct ccgggtcgtg 12841 cacggcggtt ctggccgcct cccggtcctc acgccccctt ttattgatct catcgcgtac 12901 gtcggcgtac gtcctgggcc caacccgcat ggtgtccagg aaggtgtccg ccatttccag 12961 ggcccacgac atgctcccc //