GenBank-Updates@genbank.bio.net (05/22/91)
LOCUS HUMCOLL 2452 bp ss-mRNA PRI 22-MAY-1991
DEFINITION Human mRNA encoding Pro-alpha-2 chain of type I procollagen. (major
part)
ACCESSION V00503 J00115
KEYWORDS collagen; complementary DNA.
SOURCE Homo sapiens RNA.
ORGANISM Homo sapiens
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE 1 (bases 1 to 2452)
AUTHORS Bernard,M.P., Myers,J.C., Chu,M.L., Ramirez,F., Eikenberry,E.F. and
Prockop,D.J.
TITLE Structure of a cDNA for the Pro-alpha-2 chain of human type I
procollagen. Comparison with chick cDNA for Pro-alpha-2(I)
identifies structurally conserved features of the protein and the
gene
JOURNAL Biochemistry 22, 1139-1145 (1983)
STANDARD full automatic
COMMENT SWISS-PROT; P08123; CA21$HUMAN.
Data kindly reviewed (24-MAY-1983) by M.P. Bernard.
From EMBL 26 entry HSCOLL; dated 28-MAR-1991.
FEATURES Location/Qualifiers
CDS <1..2221
/note="procollagen (1 is 3rd base in codon)"
/codon_start=1
BASE COUNT 511 a 631 c 723 g 581 t 6 others
ORIGIN
1 ggaacctggt gtggttggtg ctgtgggcac tgctggtcca tctggtccta gtggactccc
61 aggagagagg ggtgctgctg gcatacctgg aggcaaggga gaaaagggtg aacctggtct
121 cagaggtgaa attggtaacc ctggcagaga tggtgctcgt ggtgctcatg gtgctgtagg
181 tgcccctggt cctgctggag ccacaggtga ccggggcgaa gctggggctg ctggtcctgc
241 tggtcctgct ggtcctcggg gaagccctgg tgaacgtggc gaggtcggtc ctgctggccc
301 caacggattt gctggtccgg ctggtgctgc tggtcaaccg ggtgctaaag gagaaagagg
361 agccaaaggg cctaagggtg aaaacggtgt tgttggtccc acaggccccg ttggagctgc
421 tggcccnnnn ggtccaaatg gtccccccgg tcctgctgga agtcgtggtg atggaggccc
481 ccctggtatg actggtttcc ctggtgctgc tggacggact ggtcccccag gaccctctgg
541 tatttctggc cctcctggtc cccctggtcc tgctgggaaa gaagggcttc gtggaccncg
601 aggngaccaa ggaccagcag gccgacctgg agaagtagga gcaccgggtc cccctggctt
661 cgctggtgag aagggtccct ctggagaggc tggtactgct ggacctcctg gcactccagg
721 tcctcagggt cttcttggtg ctcctggtat tctgggtctc cctggctcga gaggtgaacg
781 tggtctacct ggtgttgctg gtgctgtggg tgaacctggt cctcttggca ttgccggccc
841 tcctggggcc cgtggtcctc ctggtgctgt gggtagtcct ggagtcaacg gtgctcctgg
901 tgaagctggt cgtgatggca accctgggaa cgatggtccc ccaggtcgcg atggtcaacc
961 cggacacaag ggagagcgcg gttaccctgg caatattggt cccgttggtg ctgcaggtgc
1021 acctggtcct catggccccg tgggtcctgc tggcaaacat ggaaaccgtg gtgaaactgg
1081 tccttctggt cctgttggtc ctgctggtgc tgttggccca agaggtccta gtggcccaca
1141 aggcattcgt ggcgataagg gagagcccgg tgaaaagggg cccagaggtc ttcctggctt
1201 caagggacac aatggattgc aaggtctgcc tggtatcgct ggtcaccatg gtgatcaagg
1261 tgctcctggc tccgtgggtc ctgctggtcc taggggccct gctggtcctt ctggccctgc
1321 tggaaaagat ggtcgcactg gacatcctgg tacggttgga cctgctggca ttcgaggccc
1381 tcagggtcac caaggccctg ctggcccccc tggtccccct ggccctcttg gacctctagg
1441 tgtaagcggt ggtggttatg actttggtta cgatggagac ttctacaggg ctgaccagcc
1501 ttctctcaga cccaaggact atgaagttga tgctactctg aagtctctca acaaccagat
1561 tgagaccctt cttactcctg aaggctctag aaagaaccca gctcgcacat gccgtgactt
1621 gagactcagc cacccagagt ggagcagcgg ttactactgg attgacccca accaaggatg
1681 cactatggaa gccatcaaag tatactgtga tttccctacc ggcgaaacct gtatccgggc
1741 ccaacctgaa aacatcccag ccaagaactg gtataggagc tccaaggaca agaaacacgt
1801 ctggctagga gaaactatca atgctggcag ccagtttgaa tataatgttg aaggagtgac
1861 ttccaaggaa atggctaccc aacttgcctt catgcgcctg ctggccaact atgcctctca
1921 gaacatcacc taccactgca agaacagcat tgcatacatg gatgaggaga ctggcaacct
1981 gaaaaaggct gtcattctac agggctctaa tgatgttgaa cttgttgctg agggcaacag
2041 caggttcact tacactgttc ttgtagatgg ctgctctaaa aagacaaatg aatggggaaa
2101 gacaatcatt gaatacaaaa caaataagcc atcacgcctg cccttccttg atattgcacc
2161 tttggacatc ggtggtgctg accatgaatt ctttgtggac attggcccag tctgtttcaa
2221 ataaatgaac taaaattaac ttaaaggccc ccccctcaga attattcttt gtcatttctt
2281 tttgtaatga gagctgactc cttccatttt ttttctgttc atctacttgc ttaaactctg
2341 ggcgaaagag aaggagaaga attgattgga gcattgtgca atgaaattta atacagcccc
2401 aaaaggactt ggaagtcttt caagatttaa caccttgctt tgggaaatgt ca
//