GenBank-Updates@genbank.bio.net (05/30/91)
LOCUS NEMA2C4A 5909 bp ss-mRNA INV 30-MAY-1991
DEFINITION A.suum alpha-2 (IV) collagen mRNA, complete cds.
ACCESSION M67507
KEYWORDS alpha-2 (IV) collagen.
SOURCE Ascaris suum () Adult Ovarian tissue cDNA to mRNA.
ORGANISM Ascaris suum
Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia;
Ascaridida; Ascaridina; Ascaridoidea; Ascarididae.
REFERENCE 1 (bases 1 to 5909)
AUTHORS Pettitt,J. and Kingston,I.I.
TITLE The complete primary structure of a nematode alpha 2(IV) collagen
and the partial structural organization of its gene
JOURNAL J. Biol. Chem. (1991) In press
STANDARD full automatic
FEATURES Location/Qualifiers
5'UTR 1..162
CDS 163..5451
/gene="alpha-2 (IV) collagen"
/codon_start=163
sig_peptide 163..240
/codon_start=163
misc_feature 241..640
/function="'7S domain'"
misc_signal 538..546
/function="'gylcosylation site'"
misc_feature 641..4749
/function="'triple-helical region'"
misc_feature 4750..5451
/function="'NC1 domain'"
3'UTR 5452..5909
polyA_signal 5856..5861
BASE COUNT 1418 a 1528 c 1821 g 1142 t
ORIGIN
1 tgctgcttgc ttctagatcg gcggcaagca ccgcttcgtt taaaggcgag aagcatcgac
61 cactcagagc agaagtttgt gcggcacgcc cacagcgcag ttagctcctc gactgactgc
121 acttcaggct gaccgtgtgc tgctctgagc cgcctcgtca gcatgagttc gcggctcaga
181 ataccgttat ggctattatt acccacaact gccctcgttt attttgttac gacagttagt
241 acgcaaataa catgtcgaga ttgcactaat agaggatgct tctgcgttgg cgagaaagga
301 agtatgggaa ttcccggtcc tcaaggccct cctggtgcac aaggcatccg cggattccca
361 ggtcccgagg gccttccagg acctaaaggc cagaaaggat cccagggacc acctggacca
421 caaggaatca aaggagaccg gggaataatt ggtgtgcctg gttttcctgg taatgatgga
481 gcgaatggac gtcccggaga gcctgggccg cccggtgccc caggatggga tggttgcaac
541 ggaacggatg gtgcgcccgg tgtacctggt ttgcctggac cacctggaat gccaggattt
601 cccggccctc caggtgttcc cggaatgaaa ggagaaccgg ctattggata tgcaggagct
661 cctggagaga agggtgacgc tggtatgcct ggtatgcccg gtttgcctgg cccgcccgga
721 agagatggat ttcccggaga gaaaggcgac cgaggagacg ttggacaagc cggacctcga
781 ggaccaccag gtgaagctgg accacccgga aatccaggta tcggaagtat tggaccgaaa
841 ggagaccccg gagagcaagg acctcgcggt ccacaaggtc cacctggccc agttccatca
901 actggcgcca aaggcactat cattggtcca gagggtgccc ctggaatgaa gggagaaaag
961 ggcgatccgg gagaagcagg tccaagagga ttccctggga caccaggagt ggctgggcag
1021 ccaggacttc ctggaatgaa aggagaaaag ggtctttcgg gacctgccgg gccaaggggg
1081 aaagaaggac ggcctggttt accgggtcca ccaggattta aaggcgatcg tggtctcgat
1141 ggactacccg gtgtgcctgg tttgcctggt caaaaaggtg aggctggatt cccaggaaga
1201 gatggagcaa aaggagcgag aggaccgcca ggaccaccgg gaggtggaga gttttctgat
1261 gggccaccgg gacctcctgg actgcctgga cgtgagggac aacccggtcc gccaggagca
1321 gatggatatc caggcccacc tggaccgcag ggaccgcaag gtctgccggg tggaccaggt
1381 cttcctggac ttcctggtct tgaaggtttg ccgggaccga aaggagaaaa aggtgattct
1441 ggaatccccg gtgcccctgg tgttcaagga cctcctggac tagctggacc gcctggcgca
1501 aaaggtgaac ccggaccacg aggtgtagat ggacaaagta ttcctggtct gccaggaaag
1561 gatggaaggc ccggactgga cggtctcccc ggaagaaaag gagaaatggg actgcctggc
1621 gtacgaggcc cacccggaga ctccttgaat ggacttcctg gaccacctgg accacgcggt
1681 cctcaaggtc ccaaaggtta tgatggacgc gacggcgctc ccggtctacc aggtattcca
1741 ggccctaaag gcgatcgtgg aggaacatgc gcgttttgcg cccatggagc aaagggagaa
1801 aaaggtgacg ctggatatgc tggtttgccc gggcctcagg gtgagcgcgg cctgccaggt
1861 attcccggag caactggtgc gccaggtgac gatggacttc ctggagcgcc tggacgtccc
1921 ggtcccccag gcccacctgg acaagacggc ttaccaggcc ttcctgggca aaaaggagaa
1981 ccaacacagc tcacccttcg gcctggccct ccagggtacc ctggacaaaa gggtgaaacc
2041 ggcttccctg gaccgcgcgg acaagaagga ttacctggaa aacctggaat tgttggtgcg
2101 cccggattgc ctgggccacc gggaccaaag ggcgagccgg gactaactgg tttgcccgaa
2161 aaaccaggaa aggacggaat ccccggcctg ccgggcttga aaggagaacc tggatatggg
2221 caaccgggaa tgcccggttt gcctggaatg aaaggagacg ctggcttacc tggattgccc
2281 ggtttgcctg gtgcagtggg acctatgggg ccgccagtcc ctgagagtca gctaaggcca
2341 gggccgccag gaaaggatgg attgccaggc ttgccagggc ccaagggcga agctggattc
2401 cctggggcac ctggtttgca aggcccagcc ggattgcctg gattgcctgg aatgaaagga
2461 aatccaggcc taccaggagc tccgggtctc gctggacttc ctggaatacc tggagaaaaa
2521 gggatcgcag gaaagcctgg gcttcctggg cttactggag ctaaaggaga agctggatat
2581 cctggacagc cgggtctacc tggtccgaag ggagaaccag ggccatcaac aactggacca
2641 cccggacctc caggattccc tggacttaaa ggaaaggacg gaattcccgg tgccccagga
2701 ttgcctggtc tcgaaggaca gcgcggttta ccaggagtcc caggacaaaa gggagagatt
2761 ggccttccag ggcttgctgg tgcacctggt ttcccaggag ctaaaggaga gcctggatta
2821 cctggcctgc caggcaaaga gggaccgcaa ggaccaccag gacaaccggg agcgccaggt
2881 ttccctggtc aaaaaggtga tgaaggtctc cctggtctgc cgggagtttc tggaatgaaa
2941 ggcgatacag gtctcccagg tgttcctggg cttgctggac cccccggaca accaggattt
3001 ccaggacaga aaggacagcc agggttccct ggagttgccg gagctaaagg tgaagctggc
3061 ttgcctggat tacctggcgc acctggtcaa aagggagaac aaggattggc agggcttcct
3121 ggcataccag gaatgaaagg agcccctggg atcccaggag cgccaggtca agatggtctt
3181 cctggtttgc cgggagtgaa aggagatcga ggatttaacg gactgccggg tgagaaggga
3241 gagccgggtc cagcagcacg agatggcgaa aaaggagagc caggcttgcc tggccaaccg
3301 ggtcttcgcg gaccacaagg gcctcctggt ctccctggtt tgcctggatt gaaaggagat
3361 gagggacaac ccggatacgg agcaccaggc ttgatgggtg agaagggtct cccgggtctg
3421 cctgggaaac ctgggcgacc aggcgctcca ggaccgaaag gactcgatgg tgcgccaggc
3481 tttcccggcc ttaagggtga agcgggtttg cctggagcac ccggattacc cgggcaagat
3541 ggactaccgg gactgcctgg tcagaaaggt gaaagtggat tccctggcca acctggcctt
3601 gttgggcctc caggtctgcc aggtaagatg ggcgctcccg gcattcgtgg agagaaagga
3661 gacgctggac tgcctggact tcctggtgaa cgtggtcttg atggtctccc aggccagaaa
3721 ggagaagcag gcttccctgg ggctcccggt ctccctgggc ctgttgggcc taagggcagt
3781 gctggtgcac ctggtttccc cggtttgaaa ggtgaacctg gccttccagg tctagaagga
3841 caacccggac cacgtggaat gaaaggagaa gctggattac ccggtgctcc tggaagagac
3901 ggtctgccgg gcttacctgg catgaaggga gaagcaggac ttcctggcct gccagggcaa
3961 ccaggaaaat cgatcactgg tccgaaaggt aacgctgggc tccccggact gccaggaaaa
4021 gatggcctgc ctggcttacc aggtcttaaa ggtgaacctg gaaagccagg atatgcaggc
4081 gcagctggaa taaaaggaga acctggactg cctggaattc caggcgcgaa aggtgaacct
4141 ggtttatcag gcataccagg caagcgagga aacgatggaa taccaggaaa accaggcccc
4201 gcaggactac caggcctccc aggaatgaag ggtgaaagcg gcttgccagg accgcaaggg
4261 ccagctggac tgcccggttt gcctggtctc aaaggagaac caggtttacc tggtttccca
4321 ggacagaaag gagaaactgg attccctgga caacctggaa tccctggcct cccgggcatg
4381 aagggtgact ctggttaccc tggagcacca ggaagagatg gagcacctgg caaacaagga
4441 gaaccaggac cgatgggacc tccaggtgca cagccaattg ttcagagagg tgagaaaggt
4501 gagatgggac cgatgggcgc gccaggtatt cgaggcgaga agggattgcc tggtcttgac
4561 ggccttccag gaccaagcgg tccacctgga tttgcgggcg ctaaaggccg ggacggcttc
4621 cctggacagc cgggtatgcc aggagagaaa ggtgcgccag ggcttcctgg atttccgggc
4681 attgaaggca tccctggccc accaggtctc ccaggaccta gtggaccacc aggaccaccg
4741 ggtccatctt acaaggatgg attcctgctt gtgaagcaca gtcagacttc agaagtacca
4801 caatgtccgc cgggaatggt gaaactttgg gatggctact cattgcttta catcgaagga
4861 aacgagaaat ctcacaacca ggatctcgga cacgcgggtt cgtgcttgtc gcgattctcg
4921 acaatgccat tcctattctg tgacgtgaac aacgtgtgca attatgcatc tcgcaacgac
4981 aaatcctatt ggttgtcaac aacggcgccc attccaatga tgcctgtcag tgagggtggc
5041 attgaaccat acatctcaag atgcgcagta tgcgaagcgc cagccaatgt aatcgccgtc
5101 cattcgcaga cgattcagat cccgaactgc cccaacggat ggaattctct ttggattggt
5161 tattcgttcg cgatgcacac gggtgctggc gcagaaggag gaggacagtc gctcagttca
5221 cctggatcat gtctagaaga cttccgcgct actccgttca tcgaatgcaa tggcgcacgt
5281 ggtacctgtc attacttcgc caataaattc agtttctggc tgacgacaat tgaggatgac
5341 caacaattca ggattccgga aagtgagacg ctgaaagcag gtagcttacg cacacgggtt
5401 tcgcgatgcc aggtctgcat cagatcacca gacgtgcaac cgtatcgagg atgacgccta
5461 tccactttgc acttcagcct atctatgagc gtatttagct gctacggcaa cgttgaattt
5521 ttcttttttt ttgctaaaac tgccataaca actaaactca tattgaaatt attcgttcgc
5581 ctaaggggtt tgcttgctgc agttgtctcc cctgtttcta tttttttctt tttttttgta
5641 tagaagttaa gcctagaatg ggcagcaaac agccgcaaag gcacttcgat acgtgccaaa
5701 atggagcact ttgttaagaa actgcccggc aacgctgtct cattggcgat tcatttcctt
5761 cgccgaaact ccttgtgata atagcaactg cctgttggtg ctcgcctctc tttcgtgcat
5821 tttgttctcc ttcaatttct tacataaagt gtttcaataa agtttacttt aaaaaaaaaa
5881 aaaaaaaaaa aaaaaaaaaa aaaaaaaaa
//