GenBank-Updates@genbank.bio.net (05/27/91)
LOCUS HUMERCC25 2872 bp ss-mRNA PRI 27-MAY-1991 DEFINITION Human genomic and mRNA sequence for ERCC2 gene 5'region involved in DNA excision repair ACCESSION X52221 X52470 KEYWORDS ATP-binding protein; DNA repair; DNA-binding protein; ERCC2 gene; excision repair gene. SOURCE Homo sapiens RNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2872) AUTHORS Weber,C.A. JOURNAL Unpublished (1990) STANDARD full automatic REFERENCE 2 (bases 1 to 2872) AUTHORS Weber,C.A., Salazar,E.P., Stewart,S.A. and Thompson,L.H. TITLE ERCC2: cDNA cloning and molecular characterisation of a human nucleotide excision repair gene with high homology to yeast RAD3 JOURNAL EMBO J. 9, 1437-1447 (1990) STANDARD full automatic COMMENT *source: Genomic: developmental stage=adult; cell type=fibroblast; *source: library=p5T4-1; clone=p5T4-1-7, p5T4-1-15 and p5T4-1-28; **map: chromosome=19q13.2-q13.3; *source: mRNA: cell line=GM637; library=pcD2; clone=pER2-14; Note sequence is compiled from both genomic and cDNA sources. Position 1-443 is derived mainly from genomic clones and sequence 444-2872 is derived from cDNA clones. See <X52222> for 3'end of ERCC2. From EMBL entry HSERCC25; dated 24-JUL-1990. FEATURES Location/Qualifiers misc_feature 100..133 /note="pyrimidine-rich region" misc_feature 191..196 /note="GC box" repeat_unit 208..212 /note="inverted repeat A" repeat_unit 215..219 /note="inverted repeat A'" promoter 245..253 /note="CAAT box" promoter 276..282 /note="TATA box" precursor_RNA 301..2872 /note="transcript" mRNA 301..383 /note="exon 1" misc_feature 337..353 /note="minisatellite DNA" CDS 379..383 /note="ERCC2 gene product (AA 1-2) (383 is 2nd base in codon)" /codon_start=379 misc_feature 383..409 /note="partial minisatellite DNA" intron 384..686 /note="intron I" misc_feature 414..437 /note="partial minisatellite DNA" misc_feature 626..663 /note="minisatellite DNA" misc_feature 664..693 /note="minisatellite DNA" mRNA 687..>686 /note="exon 2" CDS 687..>2872 /note="ERCC2 gene product (AA 3-730) (687 is 3rd base in codon) (2872 is 1st base in codon)" /codon_start=687 BASE COUNT 560 a 888 c 871 g 553 t ORIGIN 1 gctatcttgc tcaagctgat ctcgaactcc tgggttcgat caatactcag acaatcttgg 61 caggcgcagg aggaccaaat tctagtgaat gagatcgagt ctctcggctc tttcccttcc 121 atgttttctt tttgattggc cctcgacgat cctcagtgac gcctcccgca ccgcctcacc 181 cgagagtcag ccgccctcgc ttttccgtgc gcacgcgcag tatcccgatt ggctctgccc 241 tagcggattg acgggcaggt tagccaatgg tctcgtaata taggtggagc gagccctcga 301 ggatgtccac gacccggcct ctcgctgaat attcatgagg gaggcgggtc gaccccgctg 361 cacagtccgg ccggcgccat gaagtgagaa gggggctggg ggtcgcgctc gctagcgggc 421 gcggggggtc ttgaagatgg ggtcatcggt gggcgcgcct gggtccccaa gggggcgagg 481 ggagggtgaa ggggtgggac gggggcagcc gcagggagca gcagtgatag cgaggagaca 541 ctgagggggc cccgaggctc ctgaggacct gagggttacc gggggcgccg ggcccgtcac 601 ccttctctgg gctcgacgac cgggcactgt ggaggcggga gaggggctga ggggacggga 661 actgacccag cagcccctgc cgccaggctc aacgtggacg ggctcctggt ctacttcccg 721 tacgactaca tctaccccga gcagttctcc tacatgcggg agctcaaacg cacgctggac 781 gccaagggtc atggagtcct ggagatgccc tcaggcaccg ggaagacagt atccctgttg 841 gccctgatca tggcatacca gagagcatat ccgctggagg tgaccaaact catctactgc 901 tcaagaactg tgccagagat tgagaaggtg attgaagagc ttcgaaagtt gctcaacttc 961 tatgagaagc aggagggcga gaagctgccg tttctgggac tggctctgag ctcccgcaaa 1021 aacttgtgta ttcaccctga ggtgacaccc ctgcgctttg ggaaggacgt cgatgggaaa 1081 tgccacagcc tcacagcctc ctatgtgcgg gcgcagtacc agcatgacac cagcctgccc 1141 cactgccgat tctatgagga atttgatgcc catgggcgtg aggtgcccct ccccgctggc 1201 atctacaacc tggatgacct gaaggccctg gggcggcgcc agggctggtg cccatacttc 1261 cttgctcgat actcaatcct gcatgccaat gtggtggttt atagctacca ctacctcctg 1321 gaccccaaga ttgcagacct ggtgtccaag gaactggccc gcaaggccgt cgtggtcttc 1381 gacgaggccc acaacattga caacgtctgc atcgactcca tgagcgtcaa cctcacccgc 1441 cggacccttg accggtgcca gggcaacctg gagaccctgc agaagacggt gctcaggatc 1501 aaagagacag acgagcagcg cctgcgggac gagtaccggc gtctggtgga ggggctgcgg 1561 gaggccagcg ccgcccggga gacggacgcc cacctggcca accccgtgct gcccgacgaa 1621 gtgctgcagg aggcagtgcc tggctccatc cgcacggccg agcatttcct gggcttcctg 1681 aggcggctgc tggagtacgt gaagtggcgg ctgcgtgtgc agcatgtggt gcaggagagc 1741 ccgcccgcct tcctgagcgg cctggcccag cgcgtgtgca tccagcgcaa gcccctcaga 1801 ttctgtgctg aacgcctccg gtccctgctg catactctgg agatcaccga ccttgctgac 1861 ttctccccgc tcaccctcct tgctaacttt gccacccttg tcagcaccta cgccaaaggc 1921 ttcaccatca tcatcgagcc ctttgacgac agaaccccga ccattgccaa ccccatcctg 1981 cacttcagct gcatggacgc ctcgctggcc atcaaacccg tatttgagcg tttccagtct 2041 gtcatcatca catctgggac actgtccccg ctggacatct accccaagat cctggacttc 2101 caccccgtca ccatggcaac cttcaccatg acgctggcac gggtctgcct ctgccctatg 2161 atcatcggcc gtggcaatga ccaggtggcc atcagctcca aatttgagac ccgggaggat 2221 attgctgtga tccggaacta tgggaacctc ctgctggaga tgtccgctgt ggtccctgat 2281 ggcatcgtgg ccttcttcac cagctaccag tacatggaga gcaccgtggc ctcctggtat 2341 gagcagggga tccttgagaa catccagagg aacaagctgc tctttattga gacccaggat 2401 ggtgccgaaa ccagtgtcgc cctggagaag taccaggagg cctgcgagaa tggccgcggg 2461 gccatcctgc tgtcagtggc ccggggcaaa gtgtccgagg gaatcgactt tgtgcaccac 2521 tacgggcggg ccgtcatcat gtttggcgtc ccctacgtct acacacagag ccgcattctc 2581 aaggcgcggc tggaatacct gcgggaccag ttccagattc gtgagaatga ctttcttacc 2641 ttcgatgcca tgcgccacgc ggcccagtgt gtgggtcggg ccatcagggg caagacggac 2701 tacggcctca tggtctttgc cgacaagcgg tttgcccgtg gggacaagcg ggggaagctg 2761 ccccgctgga tccaggagca cctcacagat gccaacctca acctgaccgt ggacgagggt 2821 gtccaggtgg ccaagtactt cctgcggcag atggcacagc ccttccaccg gg //