GenBank-Updates@genbank.bio.net (05/27/91)
LOCUS HUMERCC25 2872 bp ss-mRNA PRI 27-MAY-1991
DEFINITION Human genomic and mRNA sequence for ERCC2 gene 5'region involved in
DNA excision repair
ACCESSION X52221 X52470
KEYWORDS ATP-binding protein; DNA repair; DNA-binding protein; ERCC2 gene;
excision repair gene.
SOURCE Homo sapiens RNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2872)
AUTHORS Weber,C.A.
JOURNAL Unpublished (1990)
STANDARD full automatic
REFERENCE 2 (bases 1 to 2872)
AUTHORS Weber,C.A., Salazar,E.P., Stewart,S.A. and Thompson,L.H.
TITLE ERCC2: cDNA cloning and molecular characterisation of a human
nucleotide excision repair gene with high homology to yeast RAD3
JOURNAL EMBO J. 9, 1437-1447 (1990)
STANDARD full automatic
COMMENT *source: Genomic: developmental stage=adult; cell type=fibroblast;
*source: library=p5T4-1; clone=p5T4-1-7, p5T4-1-15 and p5T4-1-28;
**map: chromosome=19q13.2-q13.3; *source: mRNA: cell line=GM637;
library=pcD2; clone=pER2-14; Note sequence is compiled from both
genomic and cDNA sources. Position 1-443 is derived mainly from
genomic clones and sequence 444-2872 is derived from cDNA clones.
See <X52222> for 3'end of ERCC2.
From EMBL entry HSERCC25; dated 24-JUL-1990.
FEATURES Location/Qualifiers
misc_feature 100..133
/note="pyrimidine-rich region"
misc_feature 191..196
/note="GC box"
repeat_unit 208..212
/note="inverted repeat A"
repeat_unit 215..219
/note="inverted repeat A'"
promoter 245..253
/note="CAAT box"
promoter 276..282
/note="TATA box"
precursor_RNA 301..2872
/note="transcript"
mRNA 301..383
/note="exon 1"
misc_feature 337..353
/note="minisatellite DNA"
CDS 379..383
/note="ERCC2 gene product (AA 1-2) (383 is 2nd base in
codon)"
/codon_start=379
misc_feature 383..409
/note="partial minisatellite DNA"
intron 384..686
/note="intron I"
misc_feature 414..437
/note="partial minisatellite DNA"
misc_feature 626..663
/note="minisatellite DNA"
misc_feature 664..693
/note="minisatellite DNA"
mRNA 687..>686
/note="exon 2"
CDS 687..>2872
/note="ERCC2 gene product (AA 3-730) (687 is 3rd base in
codon) (2872 is 1st base in codon)"
/codon_start=687
BASE COUNT 560 a 888 c 871 g 553 t
ORIGIN
1 gctatcttgc tcaagctgat ctcgaactcc tgggttcgat caatactcag acaatcttgg
61 caggcgcagg aggaccaaat tctagtgaat gagatcgagt ctctcggctc tttcccttcc
121 atgttttctt tttgattggc cctcgacgat cctcagtgac gcctcccgca ccgcctcacc
181 cgagagtcag ccgccctcgc ttttccgtgc gcacgcgcag tatcccgatt ggctctgccc
241 tagcggattg acgggcaggt tagccaatgg tctcgtaata taggtggagc gagccctcga
301 ggatgtccac gacccggcct ctcgctgaat attcatgagg gaggcgggtc gaccccgctg
361 cacagtccgg ccggcgccat gaagtgagaa gggggctggg ggtcgcgctc gctagcgggc
421 gcggggggtc ttgaagatgg ggtcatcggt gggcgcgcct gggtccccaa gggggcgagg
481 ggagggtgaa ggggtgggac gggggcagcc gcagggagca gcagtgatag cgaggagaca
541 ctgagggggc cccgaggctc ctgaggacct gagggttacc gggggcgccg ggcccgtcac
601 ccttctctgg gctcgacgac cgggcactgt ggaggcggga gaggggctga ggggacggga
661 actgacccag cagcccctgc cgccaggctc aacgtggacg ggctcctggt ctacttcccg
721 tacgactaca tctaccccga gcagttctcc tacatgcggg agctcaaacg cacgctggac
781 gccaagggtc atggagtcct ggagatgccc tcaggcaccg ggaagacagt atccctgttg
841 gccctgatca tggcatacca gagagcatat ccgctggagg tgaccaaact catctactgc
901 tcaagaactg tgccagagat tgagaaggtg attgaagagc ttcgaaagtt gctcaacttc
961 tatgagaagc aggagggcga gaagctgccg tttctgggac tggctctgag ctcccgcaaa
1021 aacttgtgta ttcaccctga ggtgacaccc ctgcgctttg ggaaggacgt cgatgggaaa
1081 tgccacagcc tcacagcctc ctatgtgcgg gcgcagtacc agcatgacac cagcctgccc
1141 cactgccgat tctatgagga atttgatgcc catgggcgtg aggtgcccct ccccgctggc
1201 atctacaacc tggatgacct gaaggccctg gggcggcgcc agggctggtg cccatacttc
1261 cttgctcgat actcaatcct gcatgccaat gtggtggttt atagctacca ctacctcctg
1321 gaccccaaga ttgcagacct ggtgtccaag gaactggccc gcaaggccgt cgtggtcttc
1381 gacgaggccc acaacattga caacgtctgc atcgactcca tgagcgtcaa cctcacccgc
1441 cggacccttg accggtgcca gggcaacctg gagaccctgc agaagacggt gctcaggatc
1501 aaagagacag acgagcagcg cctgcgggac gagtaccggc gtctggtgga ggggctgcgg
1561 gaggccagcg ccgcccggga gacggacgcc cacctggcca accccgtgct gcccgacgaa
1621 gtgctgcagg aggcagtgcc tggctccatc cgcacggccg agcatttcct gggcttcctg
1681 aggcggctgc tggagtacgt gaagtggcgg ctgcgtgtgc agcatgtggt gcaggagagc
1741 ccgcccgcct tcctgagcgg cctggcccag cgcgtgtgca tccagcgcaa gcccctcaga
1801 ttctgtgctg aacgcctccg gtccctgctg catactctgg agatcaccga ccttgctgac
1861 ttctccccgc tcaccctcct tgctaacttt gccacccttg tcagcaccta cgccaaaggc
1921 ttcaccatca tcatcgagcc ctttgacgac agaaccccga ccattgccaa ccccatcctg
1981 cacttcagct gcatggacgc ctcgctggcc atcaaacccg tatttgagcg tttccagtct
2041 gtcatcatca catctgggac actgtccccg ctggacatct accccaagat cctggacttc
2101 caccccgtca ccatggcaac cttcaccatg acgctggcac gggtctgcct ctgccctatg
2161 atcatcggcc gtggcaatga ccaggtggcc atcagctcca aatttgagac ccgggaggat
2221 attgctgtga tccggaacta tgggaacctc ctgctggaga tgtccgctgt ggtccctgat
2281 ggcatcgtgg ccttcttcac cagctaccag tacatggaga gcaccgtggc ctcctggtat
2341 gagcagggga tccttgagaa catccagagg aacaagctgc tctttattga gacccaggat
2401 ggtgccgaaa ccagtgtcgc cctggagaag taccaggagg cctgcgagaa tggccgcggg
2461 gccatcctgc tgtcagtggc ccggggcaaa gtgtccgagg gaatcgactt tgtgcaccac
2521 tacgggcggg ccgtcatcat gtttggcgtc ccctacgtct acacacagag ccgcattctc
2581 aaggcgcggc tggaatacct gcgggaccag ttccagattc gtgagaatga ctttcttacc
2641 ttcgatgcca tgcgccacgc ggcccagtgt gtgggtcggg ccatcagggg caagacggac
2701 tacggcctca tggtctttgc cgacaagcgg tttgcccgtg gggacaagcg ggggaagctg
2761 ccccgctgga tccaggagca cctcacagat gccaacctca acctgaccgt ggacgagggt
2821 gtccaggtgg ccaagtactt cctgcggcag atggcacagc ccttccaccg gg
//